-J 



-67- 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: (Other 
{US 

NICOLA, Alison FARLEY, 



Chan US) AMRAD OPERATIONS PTY LTD 
>nly) Douglas James HILTON, Nicos Antony 
Tracey WILLSON, Jian-Guo ZHANG, Warren 
ALEXANDER, Steven RAKAR, Louis FABRI, Tetsuo XOJIMA, Masatsugu MAEDA, 
Yasumfumi XIKUCHI, Andrew HASH 

(ii) TITLE OF INVENTION : A NOVEL HAEMPOIETIN RECEPTOR AND GENETIC 

SEQUENCES ENCODING SAME 

(iii) NUMBER OF SEQUENCES : 54 

{iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SCJLLY, SCOTT, MURPHY & PRESSOR 

(B) STREET: 400 GARDEN CITY PLAZA 

(C) CITY: GARDEN ZITY 

(D) STATE: NEW YO OC 

(E) COUNTRY: UNIT 3D STATES OF AMERICA 

(F) ZIP: 11530-02 19 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: ]' loppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSVEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patcntln Release #1.0, Version #1.25 

(vii) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: CIP APPLICATION OF USSN 08/928,720 

(B) FILING DATE:l(j-MAR-1998 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NttoBER: PCT/GB97/02479 

(B) FILING DATE: j l-SEP-1997 



(vii) PRIOR APPLICATION 



(A) APPLICATION NTMBER: US APPLICATION NO. 08/928,720 

(B) FILING DATE: |l-S£?-1997 

(vii) PRIOR APPLICATION 



DATA: 

(A) APPLICATION NUMBER: P0224S/96 

(B) FILING DATE: 



(viii) ATTORNEY / AGENT INFC iRMATION : 



(A) NAME: DIGIGLIC 



(B) REGISTRATION lOz 31,346 



(C) REFERENCE /DOCI 



(ix) TELECOMMUNICATION INFORMATION: 



(A) TELEPHONE: +531 

(B) TELEFAX: +516 



DATA: 



l-SEP-1996 



FRANK S 



ET NUMBER: 10857Z 



6 742 4343 
742 4366 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 



Trp Ser Xaa Trp Ser 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: base pairs 
£B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 2: 



ACTCGCTCCA GATTCCCGCC TTTT 



(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



TCCCGCCTTT TTCGACCCAT AGAT 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GGTACTTGGC TTGGAAGAGG AAAT 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 
CGGCTCACGT GCACGTCGGG TGGG 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 
AGCTGCTGTT AAAGGGCTTC TC 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 



(A/G)CTCCA(A/G)TC(A/G) CTCCA 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



(A/G)CTCCA(C/T)TC(A/G) CTCCA 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

( i i ) MOLECULE TYPE : DNA . 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
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AAGTGTGACC ATCATGTGGA C 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GGAGGTGTTA AGGAGGCG 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
ATGCCCGCGG GTCGCCCG 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1506 bass pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .1242 

(Xi) SEQUENCE DESCRIPTION; SEQ ID NO: 12: 

GGCACGAGCT TCGCTGTCCG CGCCCAGTGA CGCGCGTGCG GACCCGAGCC CCAATCTGCA 
CCCCGCAGAC TCGCCCCCGC CCCATACCGG CGTTGCAGTC ACCGCCCGTT GCGCGCCACC 
CCC 



ATG CCC GCG GGT CGC CCG GGC CCC GTC GCC CAA TCC GCG CGG CGG CCG 
Met Pro Ala Gly Arg Pro Gly. Pro Val Ala Gin Ser Ala Arg Arg Pro 
•1 5 '10 15 

CCG CGG CCG CTG TCC TCG CTG TGG TCG CCT CTG TTG CTC TGT GTC CTC 
Pro Arg Pro Leu Ser S r Leu Trp Ser Pro Leu Leu Leu Cys Val Leu 
20 25 30 
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GGG GTG CCT CGG GGC GGA TCG GGA GCC CAC ACA GCT GTA ATC AGO CCC 144 
Gly Val Pro Arg Gly Gly Ser Gly Ala His Thr Ala Val He Ser Pro 
35 40 45 

CAG GAC CCC ACC CTT CTC ATC GGC TCC TCC CTG CAA GCT ACC TGC TCT 192 
Gin Asp Pro Thr Leu Leu He Gly Ser Ser Leu Gin Ala Thr Cys Ser 
50 55 60 

ATA CAT GGA GAC ACA CCT GGG GCC ACC GCT GAG GGG CTC TAC TGG ACC 240 
He His Gly Asp Thr Pro Gly Ala Thr Ala Glu Gly Leu Tyr Trp Thr 
65 70 75 80 

CTC AAT GGT CGC CGC CTG CCC TCT GAG CTG TCC CGC CTC CTT AAC ACC 288 
Leu Asn Gly Arg Arg Leu Pro Ser Glu Leu Ser Arg Leu Leu Asn Thr 
85 90 95 

TCC ACC CTG GCC CTG GCC CTG GCT AAC CTT AAT GGG TCC AGG CAG CAG 33 6 

Ser Thr Leu Ala Leu Ala Leu Ala Asn Leu Asn Gly Ser Arg Gin Gin 
100 105 110 

TCA GGA GAC AAT CTG GTG TGT CAC GCC CGA GAC GGC AGC ATT CTG GCT 384 
S©r Gly Asp Asn Leu Val Cys His Ala Arg Asp Gly Ser He Leu Ala 
115 120 125 

GGC TCC TGC CTC TAT GTT GGC TTG CCC CCT GAG AAG CCC TTT AAC ATC 432 
Gly Ser Cys Leu Tyr Val Gly Leu Pro Pro Glu Lys Pro Phe Asn He 
130 135 140 

AGC TGC TGG TCC CGG AAC ATG AAG GAT CTC ACG TGC CGC TGG ACA CCG 480 
Ser Cys Trp Ser Arg Asn Met Lys Asp Leu Thr Cys Arg Trp Thr Pro 
145 150 155 160 

GGT GCA CAC GGG GAG ACA TTC TTA CAT ACC AAC TAC TCC CTC AAG TAC 528 
Gly Ala His Gly Glu Thr Phe Leu His Thr Asn Tyr Ser Leu Lys Tyr 
165 170 175 

AAG CTG AGG TGG TAC GGT CAG GAT AAC ACA TGT GAG GAG TAC CAC ACT 576 
Lys Leu Arg Trp Tyr Gly Gin Asp Asn Thr Cys Glu Glu Tyr His Thr 
180 185 190 

GTG GGC CCT CAC TCA TGC CAT ATC CCC AAG GAC CTG GCC CTC TTC ACT 624 
Val Gly Pro His Ser Cye His He Pro Lys Asp Leu Ala Leu Phe Thr 
195 200 205 

CCC TAT GAG ATC TGG GTG GAA GCC ACC AAT CGC CTA GGC TCA GCA AGA 672 
Pro Tyr Glu He Trp val Glu Ala Thr Asn Arg Leu Gly Ser Ala Arg 
210 215 220 

TCT GAT GTC CTC ACA CTG GAT GTC CTG GAC GTG GTG ACC ACG GAC CCC 720 
Ser Asp Val Leu Thr Leu Asp Val Leu Asp Val Val Thr Thr Asp Pro 
225 230 235 240 

CCA CCC GAC GTG CAC GTG AGC CGC GTT GGG GGC CTG GAG GAC CAG CTG 768 
Pro Pro Asp Val His Val Ser Arg val Gly Gly Leu Glu Asp Gin Leu 
245 250 255 

AGT GTG CGC TGG GTC TCA CCA CCA GCT CTC AAG GAT TTC CTC TTC CAA 816 
Ser Val Arg Trp Val Ser Pro Pro Ala Leu Lys Asp Phe Leu Phe Gin 
260 265 270 

GCC AAG TAC CAG ATC CGC TAC CGC GTG GAG GAC AGC GTG GAC TGG AAG 864 
Ala Lys Tyr Gin He Arg Tyr Arg Val Glu Asp Ser Val Asp Trp Lys 
275 280 285 

GTG GTG GAT GAC GTC AGC AAC CAG ACC TCC TGC CGT CTC GCG GGC CTG 912 
Val Val Asp Asp val Ser Asn Gin Thr Ser Cys Arg Leu Ala Gly Leu 
290 295 300 

AAG CCC GGC ACC GTT TAC TTC GTC CAA GTG CGT TGT AAC CCA TTC GGG 960 
Lys Pro Gly Thr Val Tyr Phe Val Gin Val Arg Cys Asn Pro Phe Gly 
305 310 315 320 

ATC TAT GGG <TCG AAA AAG GCG GGA ATC TGG AGC GAG TGG AGC CAC CCC 1008 
He Tyr Gly Ser Lys Lys Ala Gly He Trp Ser Glu Trp Ser His Pro 
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325 330 335 

ACC GOT GCC TCC ACC CCT CGA AGT GAG CGC CCG GGC CCG GGC GGC GGG 1056 

Thr Ala Ala Ser Thr Pro Arg Ser Glu Arg Pro Gly Pro Gly Gly Gly 
340 345 350 

GTG TGC GAG CCG CGG GGC GGC GAG CCC AGC TCG GGC CCG GTG CGG CGC 1104 
Val Cys Glu Pro Arg Gly Gly Glu Pro Ser Ser Gly Pro Val Arg Arg 
355 360 365 

GAG CTC AAG CAG TTC CTC GGC TCG CTC AAG AAG CAC GCA TAC TGC TCG 1152 
Glu Leu Lys Gin Phe Leu Gly Trp Leu Lys Lys His Ala Tyr Cys Ser 
370 375 380 

AAC CTT AGT TTC CGC CTG TAC GAC CAG TGG CGT GCT TGG ATG CAG AAG 1200 
Asn Leu Ser Phe Arg Leu Tyr Asp Gin Trp Arg Ala Trp Met Gin Lys 
385 390 395 400 

TCA CAC AAG ACC CGA AAC CAG GTC CTG CCG GCT AAA CTC TAAGGATAGG 1249 
Ser Hie Lys Thr Arg Asn Gin Val Leu Pro Ala Lys Leu 
405 410 

CCATCCTCCT GCTGGGTCAG ACCTGGAGGC TCACCTGAAT TGGAGCCCCT CTGTACCATC 1309 

TGGGCAACAA AGAAACCTAC CAGAGGCTGG GGCACAATGA GCTCCCACAA CCACAGCTTT 13 69 

GGTCCACATG ATGGTCACAC TTGGATATAC CCCAGTGTGG GTAAGGTTGG GGTATTGCAG 1429 

GGCCTCCCAA CAATCTCTTT AAATAAATAA AGGAGTTGTT CAGGTAAAAA AAAAAAAAAA 1489 

AAAAAAAAAA AAAAAAA 1506 

(2) INFORMATION FOR SEQ ID NO:13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 413 aniino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Pro Ala Gly Arg Pro Gly Pro val Ala Gin Ser Ala Arg Arg Pro 
1 5 10 15 

Pro Arg Pro Leu Ser Ser Leu Trp Ser Pro Leu Leu Leu Cys Val Leu 
20 25 30 

Gly Val Pro Arg Gly Gly Ser Gly Ala His Thr Ala Val He Ser Pro 
35 40 45 

Gin Asp Pro Thr Leu Leu lie Gly Ser Ser Leu Gin Ala Thr Cys Ser 
50 55 60 

He His Gly Asp Thr Pro Gly Ala Thr Ala Glu Gly Leu Tyr Trp Thr 
65 70 75 80 

Leu Asn Gly Arg Arg Leu Pro Ser Glu Leu Ser Arg Leu Leu Asn Thr 
85 90 95 

Ser Thr Leu Ala Leu Ala Leu Ala Asn Leu Asn Gly Ser Arg Gin Gin 
100 105 HO 

Ser Gly Asp Asn Leu Val Cys His Ala Arg Asp Gly Ser He Leu Ala 
115 120 125 

Gly Ser Cys Leu Tyr Val Gly Leu Pro Pro Glu Lys Pro Phe Aen lie 
130 135 140 

Ser cys Trp Ser Arg Asn Met Lys Asp Leu Thr Cys Arg Trp Thr Pro 
145 * 150 155 160 
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Gly Ala His Gly Gl\i Thr Phe Leu His Thr Asn Tyr Ser Leu Lys Tyr 
165 170 1 7 5 

Lys Leu Arg Trp Tyr Gly Gin Asp Asn Thr Cys Glu Glu Tyr His Thr 
180 185 190 

Val Gly Pro His Ser Cys His lie Pro Lys Asp Leu Ala Leu Phe Thr 
195 200 205 

Pro Tyr Glu lie Trp Val Glu Ala Thr Asn Arg Leu Gly Ser Ala Arg 
210 215 220 

Ser Asp Val Leu Thr Leu Asp Val Leu Asp val val Thr Thr Asp Pro 
225 230 235 240 

Pro Pro Asp Val His Val Ser Arg Val Gly Gly Leu Glu Asp Gin Leu 
245 250 255 

Ser Val Arg Trp Val Ser Pro Pro Ala Leu Lys Asp Phe Leu Phe Gin 
260 265 270 

Ala Lys Tyr Gin He Arg Tyr Arg val Glu Asp Ser Val Asp Trp Lys 
275 280 285 

Val Val Asp Asp Val Ser Asn Gin Thr Ser Cys Arg Leu Ala Gly Leu 
290 295 300 

Lye Pro Gly Thr Val Tyr Phe val Gin val Arg Cys Asn Pro Phe Gly 
305 310 315 320 

lie Tyr Gly Ser Lys Lys Ala Gly He Trp Ser Glu Trp Ser His Pro 
325 330 335 

Thr Ala Ala Ser Thr Pro Arg Ser Glu Arg Pro Gly Pro Gly Gly Gly 
340 345 350 

Val Cys Glu Pro Arg Gly Gly Glu Pro Ser Ser Gly Pro val Arg Arg 
355 360 . 365 

Glu Leu Lys Gin Phe Leu Gly Trp Leu Lys Lys His Ala Tyr Cys Ser 
370 375 380 

Asn Leu Ser Phe Arg Leu Tyr Asp Gin Trp Arg Ala Trp Met Gin Lys 
385 390 395 400 

Ser His Lys Thr Arg Asn Gin Val Leu Pro Ala Lys Leu 
405 410 



(2) INFORMATION FOR SEQ ID NO: 14: 

(ij SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1549 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



<ix) FEATURE: 

(A) NAME/ KEY: CDS 

(BJ LOCATION: 1 . . 1278 



(XX) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



GGCACGAGCT TCGCTGTCCG CGCCCAGTGA CGCGCGTGCG GACCCGAGCC CCAATCTGCA 
CCCCGCAGAC TCGCCCCCGC CCCATACCGG CGTTGCAGTC ACCGCCCGTT GCGCGCCACC 
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CCCA 



ATG CCC GCG GOT CGC CCG GGC CCC GTC. GCC CAA TCC GCG CGG CGG CCG 
Met Pro Ala Gly Arg Pro Gly Pro Val Ala Gin Ser Ala Arg Arg Pro 
15 10 15 

CCG CGG CCG CTG TCC TCG CTG TGG TCG CCT CTG TTG CTC TGT GTC CTC 
Pro Arg Pro Leu Ser Ser Leu Trp Ser Pro Leu Leu Leu Cys Val Leu 
20 25 30 

GGG GTG CCT CGG GGC GGA TCG GGA GCC CAC ACA GCT GTA ATC AGC CCC 
Gly Val Pro Arg Gly Gly Ser Gly Ala His Thr Ala Val lie Sar Pro 
35 40 45 

CAG GAC CCC ACC CTT CTC ATC GGC TCC TCC CTG CAA GCT ACC TGC TCT 
Gin Asp Pro Thr Leu Leu lie Gly Ser Ser Leu Gin Ala Thr Cys Ser 
50 55 60 

ATA CAT GGA GAC ACA CCT GGG GCC ACC GCT GAG GGG CTC TAG TGG ACC 
He Hie Gly Asp Thr Pro Gly Ala Thr Ala Glu Gly Leu Tyr Trp Thr 
65 70 75 80 

CTC AAT GGT CGC CGC CTG CCC TCT GAG CTG TCC CGC CTC CTT AAC ACC 
Leu Asn Gly Arg Arg Leu Pro Ser Glu Leu Ser Arg Leu Leu Asn Thr 
85 90 95 

TCC ACC CTG GCC CTG GCC CTG GCT AAC CTT AAT GGG TCC AGG CAG CAG 
Ser Thr Leu Ala Leu Ala Leu Ala Asn Leu Asn Gly Ser Arg Gin Gin 
100 105 110 

TCA GGA GAC AAT CTG GTG TGT CAC GCC CGA GAC GGC AGC ATT CTG GCT 
Ser Gly Asp Asn Leu Val Cys His Ala Arg Asp Gly Ser He Leu Ala 
115 120 125 

GGC TCC TGC CTC TAT GTT GGC TTG CCC CCT GAG AAG CCC TTT AAC ATC 
Gly Ser Cys Leu Tyr Val Gly Leu Pro Pro Glu Lys Pro Phe Asn He 
130 135 140 

AGC TGC TGG TCC CGG AAC ATG AAG GAT CTC ACG TGC CGC TGG ACA CCG 
Ser Cys Trp Ser Arg Abh Met Lys Asp Leu Thr Cys Arg Trp Thr Pro 
145 150 155 160 

GGT GCA CAC GGG GAG ACA TTC TTA CAT ACC AAC TAC TCC CTC AAG TAC 
Gly Ala His Gly Glu Thr Phe Leu His Thr Asn Tyr Ser Leu Lys Tyr 
165 170 175 

AAG CTG AGG TGG TAC GGT CAG GAT AAC ACA TGT GAG GAG TAC CAC ACT 
Lys Leu Arg Trp Tyr Gly Gin Asp Asn Thr Cys Glu Glu Tyr His Thr 
180 185 190 

GTG GGC CCT CAC TCA TGC CAT ATC CCC AAG GAC CTG GCC CTC TTC ACT 
Val Gly Pro His Ser Cys His He Pro Lys Asp Leu Ala Leu Phe Thr 
195 200 205 

CCC TAT GAG ATC TGG GTG GAA GCC ACC AAT CGC CTA GGC TCA GCA AGA 
Pro Tyr Glu He Trp Val Glu Ala Thr Asn Arg Leu Gly Ser Ala Arg 
210 215 220 

TCT GAT GTC CTC ACA CTG GAT GTC CTG GAC GTG GTG ACC ACG GAC CCC 
Ser Asp Val Leu Thr Leu Asp Val Leu Asp Val Val Thr Thr Asp Pro 
225 230 235 240 

CCA CCC GAC GTG CAC GTG AGC CGC GTT GGG GGC CTG GAG GAC CAG CTG 
Pro Pro Asp Val His Val Ser Arg Val Gly Gly Leu Glu Asp Gin Leu 
245 250 255 

AGT GTG CGC TGG GTC TCA CCA CCA GCT CTC AAG GAT TTC CTC TTC CAA 
Ser Val Arg Trp Val Ser Pro Pro Ala Leu Lys Asp Phe Leu Phe Gin 
260 265 270 

GCC AAG TAC CAG ATC CGC TAC CGC GTG GAG GAC AGC GTG GAC TGC AAG 
Ala Lys Tyr Gin He Arg Tyr Arg Val Glu Asp Ser Val Asp Trp Lys 
27S 280 285 
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GTG GTG GAT CAC GTC AGC AAC CAG ACC TCC TGC CGT CTC GCG GGC CTG 912 
Val Val Asp Asp Val Ser Asn Gin Thr Ser Cys Arg Leu Ala Gly Leu 
290 295 300 

AAG CCC GGC ACC GTT TAC TTC GTC CAA GTG CGT TGT AAC CCA TTC GGG 960 
Lys Pro Gly Thr Val Tyr Phe Val Gin Val Arg Cys Asn Pro Phe Gly 
305 310 315 320 

ATC TAT GGG TCG AAA AAG GCG GGA ATC TGG AGC GAG TGG AGC CAC CCC 1008 
lie Tyr Gly Ser Lys Lys Ala Gly lie Trp Ser Glu Trp Ser His Pro 
325 330 335 

ACC GCT GCC TCC ACC CCT CGA AGT GAG CGC CCG GGC CCG GGC GGC GGG 1056 
Thr Ala Ala Ser Thr Pro Arg Ser Glu Arg Pro Gly Pro Gly Gly Gly 
340 345 - 350 

GTG TGC GAG CCG CGG GGC GGC GAG CCC AGC TCG GGC CCG GTG CGG CGC 1104 
Val Cys Glu Pro Arg Gly Gly Glu Pro Ser Ser Gly Pro Val Arg Arg 
355 360 365 

GAG CTC AAG CAG TTC CTC GGC TGG CTC AAG AAG CAC GCA TAC TGC TCG 1152 
Glu Leu Lys Gin Phe Leu Gly Trp Leu Lys Lye His Ala Tyr Cys Ser 
370 37S 380 

AAC CTT AGT TTC CGC CTG TAC GAC CAG TGG CGT GCT TGG ATG CAG AAG 1200 
Asn Leu Ser Phe Arg Leu Tyr Asp Gin Trp Arg Ala Trp Met Gin Lys 
385 390 395 400 

TCA CAC AAG ACC CGA AAC CAG GAC GAG GGG ATC CTG CCT TCG GGC AGA 1248 
Ser His Lys Thr Arg Asn Gin Asp Glu Gly lie Leu Pro Ser Gly Arg 
405 410 415 

CGG GGT GCG GCG AGA GGT CCT GCC GGT TAAACTCTAA GG AT AGGCCA 1295 
Arg Gly Ala Ala Arg Gly Pro Ala Gly 
420 425 

TCCTCCTGCT GGGTCAGACC TGGAGGCTCA CCTGAATTGG AGCCCCTCTG TACCATCTGG 1355 

GCAACAAAGA AACCTACCAG AGGCTGGGGC ACAATGAGCT CCCACAACCA CAGCTTTGGT 1415 

C CAC ATG ATG GTCACACTTG GATATACCCC AGTGTGGGTA AGGTTGGGGT ATTGCAGGGC 1475 

CTCCCAACAA TCTCTTTAAA TAAATAAAGG AGTTGTTCAG GTAAAAAAAA AAAAAAAAAA 1535 

AAAAAAAAAA AAAA 1549 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 425 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Met Pro Ala Gly Arg Pro Gly Pro Val Ala Gin Ser Ala Arg Arg Pro 
15 10 15 

Pro Arg Pro Leu Ser Ser Leu Trp Ser Pro Leu Leu Leu Cys Val Leu 
20 25 30 

Gly Val Pro Arg Gly Gly Ser Gly Ala His Thr Ala Val He Ser Pro 
35 40 45 

Gin ASD Pro Thr Leu Leu lis Gly Ser Ser Leu Gin Ala Thr Cys Scr 
SO 55 60 

He His Gly Asp Thr Pro Gly Ala Thr Ala Glu Gly Leu Tyr Trp Thr 
65 70 75 80 
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Leu Asn Gly Arg Arg Leu Pro Ser Glu Leu Ser Arg Leu Leu Asn Thr 
85 90 95 

Ser Thr Leu Ala Leu Ala Leu Ala Asn Leu Asn Gly Ser Arg Gin Gin 
100 105 no 

Ser Gly Asp Asn Leu val cys His Ala Arg Asp Gly Ser lie Leu Ala 
115 120 125 

Gly Ser Cys Leu Tyr Val Gly Leu Pro Pro Glu Lys Pro Phe Asn lie 
130 . 135 140 

Ser Cys Trp Ser Arg Asn Met Lya Asp Leu Thr Cys Arg Trp Thr Pro 
145 150 155 160 

Gly Ala His Gly Glu Thr Phe Leu His Thr Asn Tyr Ser Leu Lys Tyr 
165 170 175 

Lys Leu Arg Trp Tyr Gly Gin Asp Asn Thr Cys Glu Glu Tyr His Thr 
ISO 185 190 

Val Gly Pro His Ser Cys His He Pro Lys Asp Leu Ala Leu Phe Thr 
195 200 205 

Pro Tyr Glu He Trp Val Glu Ala Thr Asn Arg Leu Gly Ser Ala Arg 
210 215 220 

Ser Asp Val Leu Thr Leu Asp Val Leu Asp Val Val Thr Thr Asp Pro 
225 230 235 240 

Pro Pro Asp Val His Val Ser Arg Val Gly Gly Leu Glu Asp Gin Leu 
245 250 255 

Ser val Arg Trp Val Ser Pro Pro Ala Leu Lys Asp Phe Leu Phe Gin 
260 265 270 

Ala Lys Tyr Gin He Arg Tyr Arg Val Glu Asp Ser Val Asp Trp Lys 
275 280 285 

Val Val Asp Asp Val Ser Aen Gin Thr Ser Cys Arg Leu Ala Gly Leu 
290 295 300 

Lys Pro Gly Thr Val Tyr Phe Val Gin Val Arg Cys Asn Pro Phe Gly 
305 310 315 320 

He Tyr Gly Ser Lys Lys Ala Gly He Trp Ser Glu Trp Ser His Pro 
325 330 335 

Thr Ala Ala Ser Thr Pro Arg Ser Glu Arg Pro Gly Pro Gly Gly Gly 
340 345 350 

Val Cys Glu Pro Arg Gly Gly Glu Pro Ser Ser Gly Pro Val Arg Arg 
355 360 365 

Glu Leu Lys Gin Phe Leu Gly Trp Leu Lys Lys His Ala Tyr Cys Ser 
370 375 380 

Asn Leu Ser Phe Arg Leu Tyr Aep Gin Trp Arg Ala Trp Met Gin Lys 
385 390 395 400 

Ser His Lys Thr Arg Asn Gin Asp Glu Gly He Leu Pro Ser Gly Arg 
405 410 415 

Arg Gly Ala Ala Arg Gly Pro Ala Gly 
420 425 



(-2) INFORMATION FOR SEQ ID NO: 16; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 8 base pairs 

(B) TYPE: nucleic acid 
tC) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE; DNA 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION : 1..468 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

GGC ACC GTT TAC TTC GTC CAA GTG CGT TGT AAC CCA TTC GGG ATC TAT 48 
X5ly Thr Val Tyr Phe Val Gin Val Arg Cys Asn Pro Phe Gly lie Tyr 
15 10 15 



GGG TCG AAA AAG GCG GGA ATC TGG AGC GAG TGG AGO CAC CCC ACC GCT 
Gly Ser Lys Lys Ala Gly lie Trp Ser Glu Trp Ser His Pro Thr Ala 
20 25 30 



96 



Q 
tfi 



GCC TCC ACC CCT CGA AGT GAG CGC CCG GGC CCG GGC GGC GGG GTG TGC 144 
Ala Ser Thr Pro Arg Ser Glu Arg Pro Gly Pro Gly Gly Gly Val Cys 
35 40 45 

GAG CCG CGG GGC GGC GAG CCC AGC TCG GGC CCG GTG CGG CGC GAG CTC 192 
Glu Pro Arg Gly Gly Glu Pro Ser Ser Gly Pro Val Arg Arg Glu Leu 
50 55 60 



AAG CAG TTC CTC GGC TGG CTC AAG AAG CAC GCA TAC TGC TCG AAC CTT 
Lys Gin Phe Leu Gly Trp Leu Lys Lys His Ala Tyr Cys Ser Asn Leu 
65 70 75 80 



240 



AGT TTC CGC CTG TAC GAC CAG TGG CGT GCT TGG ATG CAG AAG TCA CAC 
Ser Phe Arg Leu Tyr Asp Gin Trp Arg Ala Trp Met Gin Lys Ser His 
85 90 95 



288 



hi 



AAG ACC CGA AAC CAG GTA GGA AAG TTG GGG GAG GCT TGC GTG GGG GGT 
Lys Thr Arg Asn Gin Val Gly Lys Leu Gly Glu Ala Cys val Gly Gly 
100 105 110 



336 



AAA GGA GCA GAG GAA GAG AGA GAC CCG GGT GAG CAG CCT CCA CAA CAC 
Lys Gly Ala Glu Glu Glu Arg Asp Pro Gly Glu Gin Pro Pro Gin His 
115 120 125 



334 



CGC ACT CTT CTT TCC AAG CAC AGG ACG AGG GGA TCC TGC CCT CGG GCA 
Arg Thr Leu Leu Ser Lys His Arg Thr Arg Gly Ser Cys Pro Arg Ala 
130 135 140 



432 



GAC GGG GTG CGG CGA GAG GTA AGG GGG TCT GGG TGAGTGGGGC CTACAGCAGT 485 
Asp Gly Val Arg Arg Glu Val Arg Gly Ser Gly 
145 ISO 155 



CTAGATGAGG 


CCCTTTCCCC 


TCCTTCGGTG 


TTGCTCAAAG 


GGATCTCTTA 


GTGCTCATTT 


S45 


CACCCACTGC 


AAAGAGCCCC 


AGGTTTTACT 


GCATCATCAA 


GTTGCTGAAG 


GGTCCAGGCT 


605 


TAATGTGGCC 


TCTTTTCTGC 


CCTCAGGTCC 


TGCCGGCTAA ACTCTAAGGA 


TAGGCCATCC 


665 


TCCTGCTGGG 


TCAGACCTGG 


AGGCTCACCT 


GAATTGGAGC 


CCCTCTGTAC 


CTATCTGGGC 


725 


AACAAAGAAA 


CCTACCATGA 


GGCTGGGGCA 


CAATGAGCTC 


CCACAACCAC 


AGCTTTGGTC 


785 


CACATGATGG 


TCACACTTGG 


ATATACCCCA 


GTGTGGGTAA 


GGTTGGGGTA 


TTGCAGGGCC 


845 


TCCCAACAAT 


CTCTTTAAAT 


AAATAAAGGA 


GTTGTTCAGG 


TAAAAAAAAA 


AAAAAAAAAA 


905 


AAAAAAAAAA 


AAAAAAAAAA 


AAAAAAAAAA 


AAA 






938 



(2) INFORMATION FOR SEQ ID NO: 17: 



(i) SEQUENCE CHARACTERISTICS: 

j (A) LENGTH: 155 amino acids 
(B) tfYPE: amino acid 
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(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 

Gly Thr Val Tyr Phe Val Gin Val Arg Cys Asn Pro Phe Gly lie Tyr 
1 5 10 15 

Gly Ser Lys Lys Ala Gly He Trp Ser Glu Trp Ser His Pro Thr Ala 
20 25 30 

Ala Ser Thr Pro Arg Ser Glu Arg Pro Gly Pro Gly Gly Gly Val Cys 
35 ^O 45 

Glu Pro Arg Gly Gly Glu Pro Ser Ser Gly Pro Val Arg Arg Glu Leu 
50 55 60 

Lys Gin Phe Leu Gly Trp Leu Lys Lys His Ala Tyr Cys Ser Asn Leu 
65 70 75 80 

Ser Phe Arg Leu Tyr Asp Gin Trp Arg Ala Trp Mec Gin Lys Ser His 
85 90 95 

Lys Thr Arg Asn Gin Val Gly Lys Leu Gly Glu Ala Cys Val Gly Gly 
100 105 110 

Lys Gly Ala Glu Glu Glu Arg Asp Pro Gly Glu Gin Pro Pro Gin His 
115 120 125 

Arg Thr Leu Leu Ser Lys His Arg Thr Arg Gly Ser Cys Pro Arg Ala 
130 135 140 

Asp Gly Val Arg Arg Glu Val Arg Gly Ser Gly 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 base pairs 

(B) TYPE: nucleic acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1. .834 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: IB: 

CCC ACC CTT CTC ATC GGC TCC TCC CTG CAA GCT ACC TGC TCT ATA CAT 
Pro Thr Leu Leu lie Gly Ser Ser Leu Gin Ala Thr Cys Ser lie His 
51 55 60 65 

GGA GAC ACA CCT GGG GCC ACC GCT GAG GGG CTC TAC TGG ACC CTC AAT 
Gly Asp Thr Pro Gly Ala Thr Ala Glu Gly Leu Tyr Trp Thr Leu Asn 
70 75 80 

GGT CGC CGC CTG CCC TCT GAG CTG TCC CGC CTC CTT AAC ACC TCC ACC 
Gly Arg Arg Leu Pro Ser Glu Leu Ser Arg Leu Leu Asn Thr Ser Thr 
85 90 95 

CTG GCC CTG GCC CTG GCT AAC CTT AAT GGG TCC AGG CAG CAG TCA GGA 

Leu Ala Leu Ala Leu Ala Asn Leu Asn Gly Ser Arg Gin Gin Ser Gly 
100 105 110 



GAC AAT CTG GTG TGT CAC GCC CGA GAC GGC AGC ATT CTG GCT GGC TCC 
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I : : 



Asp Asn Leu Val Cys His Ala Arg Asp Gly Ser lie Leu Ala Gly Ser 
115 120 125 130 

TGC CTC TAT GTT GGC TTG CCC CCT GAG AAG CCC TTT AAC ATC AGC TGC 
Cvs Leu Tyr Val Gly Leu Pro Pro Glu Lys Pro Phe Asn He Ser Cys 
135 140 



145 



TGG TCC CGG AAC ATG AAG GAT CTC ACG TGC CGC TGG ACA CCG GGT GCA 
Trp ser Arg Asn Met Lys Asp Leu Thr Cys Arg Trp Thr Pro Gly Ala 
150 155 200 

CAC GGG GAG ACA TTC TTA CAT ACC AAC TAC TCC CTC AAG TAC AAG CTG 
His Gly Glu Thr Phe Leu His Thr Aen Tyr Ser Leu Lys Tyr Lys Leu 
205 210 215 

AGG TGG TAC GGT CAG GAT AAC ACA TGT GAG GAG TAC CAC ACT GTG GGG 
Arg Trp Tyr Gly Gin Asp Asn Thr Cys Glu Glu Tyr His Thr val Gly 
220 225 230 

CCC CAC TCA TGC CAT ATC CCC AAG GAC CTG GCC CTC TTC ACT CCC TAT 
Pro His ser Cys His He Pro Lys Asp Leu Ala Leu Phe Thr Pro Tyr 
235 240 245 250 

GAG ATC TGG GTG GAA GCC ACC AAT CGC CTA GGC TCA GCA AGA TCT GAT 
Glu He Trp Val Glu Ala Thr Asn Arg Leu Gly Ser Ala Arg Ser Asp 
255 260 265 

GTC CTC ACA CTG GAT GTC CTG GAC GTG GTG ACC ACG GAC CCC CCA CCC 
val Leu Thr Leu Asp Val Leu Asp Val Val Thr Thr Asp Pro Pro Pro 
270 275 280 

GAC GTG CAC GTG AGC CGC GTT GGG GGC CTG GAG GAC CAG CTG AGT GTG 
Asp val His Val Ser Arg Val Gly Gly Leu Glu Asp Gin Leu Ser Val 
265 290 295 

CGC TGG GTC TCA CCA CCA GCT CTC AAG GAT TTC CTC TTC CAA GCC AAG 
Arg Trp Val Ser Pro Pro Ala Leu Lys Asp Phe Leu Phe Gin Ala Lys 
300 305 310 

TAC CAG ATC CGC TAC CGC GTG GAG GAC AGC GTG GAC TGG AAG GTG GTG 
Tyr Gin He Arg Tyr Arg val Glu Asp Ser Val Asp Trp Lys Val Val 
315 320 325 330 

GAT GAC GTC AGC AAC CAG ACC TCC TGC CGT CTC GCG GGC CTG AAG CCC 
Asp Asp Val Ser Asn Gin Thr ser Cys Arg Leu Ala Gly Leu Lys Pro 
335 340 345 

GGC ACC GTT TAC TTC GTC CAA GTG CGT TGT AAC CCA TTC GGG ATC TAT 
Gly Thr val Tyr Phe Val Gin Val Arg Cys Asn Pro Phe Gly He Tyr 
350 355 360 

GGG TCG AAA AAG GCG GGA 
Gly Ser Lys Lys Ala Gly 
365 



338 



386 



434 



432 



530 



578 



626 



674 



722 



770 



818 



866 



894 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 278 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



Pro Thr Leu Leu He Gly Ser Ser Leu Gin Ala Thr Cys Ser He His 

51 55 60 65 

Gly Asp Thr Pro Gly Ala Thr Ala Glu Gly Leu Tyr Trp Thr Leu Asn 

70 . 75 30 



-80- 



Gly Arg Arg Leu Pro Ser Glu Leu Ser Arg Leu Leu Aen Thr Ser Thr 
85 90 95 

LeU Ala Leu Ala Leu Ala Asn Leu Asn Gly Ser Arg Gin Gin Ser Gly 
100 105 HO 

Asp Asn Leu Val Cys His Ala Arg Asp Gly Ser lie Leu Ala Gly Ser 
115 120 125 130 

Cys Leu Tyr Val Gly Leu Pro Pro Glu Lys Pro Phe Asn He Ser Cys 
135 140 145 

Trp Ser Arg Asn Met Lys Asp Leu Thr Cys Arg Trp Thr Pro Gly Ala 
150 155 200 

His Gly Glu Thr Phe Leu His Thr Asn Tyr Ser Leu Lya Tyr Lys Leu 
205 210 215 

Arg Trp Tyr Gly Gin Asp Asn Thr Cys Glu Glu Tyr His Thr val Gly 
220 225 230 

Pro His Ser Cys His He Pro Lys Asp Leu Ala Leu Phe Thr Pro Tyr 
235 240 245 250 

Glu He Trp Val Glu Ala Thr Asn Arg Leu Gly Ser Ala Arg Ser Asp 
255 260 265 

Val Leu Thr Leu Asp Val Leu Asp Val Val Thr Thr Asp Pro Pro Pro 
270 275 280 

Asp Val His Val Ser Arg Val Gly Gly Leu Glu Asp Gin Leu Ser Val 
285 290 295 

Arg Trp Val Ser Pro Pro Ala Leu Lys Asp Phe Leu Phe Gin Ala Lys 
300 305 310 

Tyr Gin He Arg Tyr Arg Val Glu Asp Ser Val Asp Trp Lys Val Val 
315 320 325 330 

Asp Asp Val Ser Asn Gin Thr Ser Cys Arg Leu Ala Gly Leu Lys Pro 
335 340 345 



Gly Thr Val Tyr Phe Val Gin Val Arg Cys Asn Pro Phe Gly lie Tyr 
350 355 360 

Gly Ser Lys Lys Ala Gly 
365 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 base pairs 

(B) TYPE: nucleic acids 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



GGCATGAAGG CTTAGGGTGG GGATCGGTAG GACCCATGCA CCCAGAGAAA GGGACTGGTG 

GCAACTTTCA AACTCTCTGG GGAAGGAAGA AGGGCTGAAA GAGG 

ATG AAC GGG CTC AGA CAC AGC TGT AAT CAG CCC CCA GGA 
Met Asn Gly Leu Arg His ser Cys Asn Gin Pro Pro Gly 
5 10 



(2) INFORMATION FOR SEQ ID NO: 21: 



(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 
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(B) TYPE: amino acids 
(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Met Aen Gly Leu Arg His Ser Cys Asn Gin Pro Pro Gly 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1930 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

GGCACGAGCT TCGCTGTCCG CGCCCAGTGA CGCGCGTGCG GACCCGAGCC CCAATCTGCA 60 

CCCCGCAGAC TCGCCCCCGC CCCATACCGG CGTTGCAGTC ACCGCCCGTT GCGCGCCACC 120 

CCCAATGCCC GCGGGTCGCC CGGGCCCCGT CGCCCAATCC GCGCGGCGGC CGCCGCGGCC 180 

GCTGTCCTCG CTGTGGTCGC CTCTGTTGCT CTGTGTCCTC GGGGTGCCTC GGGGCGGATC 240 

GGGAGCCCAC ACAGCTGTAA TCAGCCCCCA GGACCCCACC CTTCTCATCG GCTCCTCCCT 300 

GCAAGCTACC TGCTCTATAC ATGGAGACAC ACCTGGGGCC ACCGCTGAGG GGCTCTACTG 360 

GACCCTCAAT GGTCGCCGCC TGCCCTCTGA GCTGTCCCGC CTCCTTAACA CCTCCACCCT 420 

GGCCCTGGCC CTGGCTAACC TTAATGGGTC CAGGCAGCAG TCAGGAGACA ATCTGGTGTG 480 

TCACGCCCGA GACGGCAGCA TTCTGGCTGG CTCCTGCCTC TATGTTGGCT TGCCCCCTGA 540 

GAAGCCCTTT AACATCAGCT GCTGGTCCCG GAACATGAAG GATCTCACGT GCCGCTGGAC 600 

ACCGGGTGCA CACGGGGAGA CATTCTTACA TACCAACTAC TCCCTCAAGT ACAAGCTGAG 660 

GTGGTACGGT CAGGATAACA CATGTGAGGA GTACCACACT GTGGGCCCTC ACTCATGCCA 720 

TATCCCCAAG GACCTGGCCC TCTTCACTCC CTATGAGATC TGGGTGGAAG CCACCAATCG 780 

CCTAGGCTCA GCAAGATCTG ATGTCCTCAC ACTGGATGTC CTGGACCTGG TGACCACGGA 840 

CCCCCCACCC GACGTGCACG TGAGCCGCGT TGGGGGCCTG GAGGACCAGC TGAGTGTGCG 900 

CTGGGTCTCA CCACCAGCTC TCAAGGATTT CCTCTTCCAA GCCAAGTACC AGATCCGCTA 960 

CCGCGTGGAG GACAGCGTGG ACTGGAAGGT GGTGGATGAC GTCAGCAACC AGACCTCCTG 1020 

CCGTCTCGCG GGCCTGAAGC CCGGCACCGT TTACTTCGTC CAAGTGCGTT GTAACCCATT 1080 

CGGGATCTAT GGCTCGAAAA AGGCGGGAAT CTGGAGCGAG TGGAGCCACC CCACCGCTOC 1140 

CTCCACCCCT CGAAGTGAGC GCCCGGGCCC GGGCGGCGGG GTGTGCGAGC CGCGGGGCGG 1200 

CGAGCCCAGC TCGGGCCCGG TGCGGCGCGA GCTCAAGCAG TTCCTCGGCT GGCTCAAGAA 1260 

GCACGCATAC TGCTCGAACC TTAGTTTCCG CCTGTACGAC CAGTGGCGTG CTTGGATGCA 1320 

GAAGTCACAC AAGACCCGAA ACCAGGTAGG AAAGTTGGGG GAGGCTTGCG TGGGGGGTAA 13 80 

AGGAGCAGAG GAAGAGAGAG ACCCGGGTGA GCAGCCTCCA CAACACCGCA CTCTTCTTTC 1440 
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CAAGCACAGG acgaggggat cctgccctcg ggcagacggg GTGCGGCGAG AGGTAAGGGG 1500 

GTCTGGGTGA GTGGGGCCTA CAGCAGTCTA GATGAGGCCC TTTCCCCTCC TTCGGTGTTG 1560 

CTCAAAGGGA TCTCTTAGTG CTCATTTCAC CCACTGCAAA GAGCCCCAGG TTTTACTGCA 1620 

TCATCAAGTT GCTGAAGGGT CCAGGCTTAA TGTGGCCTCT TTTCTGCCCT CAGGTCCTGC 1660 

CGGCTAAACT CTAAGGATAG GCCATCCTCC TGCTGGGTCA GACCTGGAGG CTCACCTGAA 1740 

TTGGAGCCCC TCTGTACCTA TCTGGGCAAC AAAGAAACCT ACCATGAGGC TGGGGCACAA 1300 

TGAGCTCCCA CAACCACAGC TTTGGTCCAC ATGATGGTCA CACTTGGATA TACCCCAGTG 1860 

TGGGTAAGGT TGGGGTATTG CAGGGCCTCC CAACAATCTC TTTAAATAAA TAAAGGAGTT 1920 

GTTCAGGTAA 1930 

(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 560 baae pairs 
Pi (B) TYPE: nucleic acid 

<C) STRANDEDNESS: single 
^ (D) TOPOLOGY: linear 

J?j (ii) MOLECULE TYPE: DNA 

^ <xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

S 1 TCCAGGCAGC GGTCGGGGGA CAACCTCGTG TGCCACGCCC GTGACGGCAG CATCCTGGCT 60 

1- GGCTCCTGCC TCTATGTTGG CCTGCCCCCA GAGAAACCCG TCAACATCAG CTGCTGGTCC 120 

y MGAACATGA AGGACTTGAC CTGCCGCTGG ACGCCAGGGG CCCACGGGGA GACCTTCCTC 180 

CACACCAACT ACTCCCTCAA GTACAAGCTT AGGTGGTATG GCCAGGACAA CACATGTGAG 240 

GAGTACCACA CAGTGGGGCC CCACTCCTGC CACATCCCCA AGGACCTGGC TCTCTTTACG 300 

CCCTATGAGA TCTGGGTGGA GGCCACCAAC CGCCTGGGCT CTGCCCGCTC CGATGTACTC 360 

ACGCTGGATA TCCTGGATGT GGTGACCACG GACCCCCCGC CCGACGTGCA CGTGAGCCGC 420 

GTCGGGGGCC TGGAGGACCA GCTGAGCGTG CGCTGGGTGT CGCCACCCGC CCTCAAGGAT 480 

TTCCTTTTTC AAGCCAAATA CCAGATCCGC TACCGAGTGG AGGACAGTGT GGAATGGAAG 540 

GTGGTGGACG ATGTGAGCAA 5 g 0 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1391 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ixj FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1. .1053 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
ACC CTC AAC GGG CGC CGC CTG CCC CCT GAG CTC TCC CGT GTA CTC AAC 48 

Thr Leu Asn Cly Arg Arg Leu Pro Pro Glu Leu Ser Arg Val Leu Asn 
15 io is 

GCC TCC ACC TTG GCT CTG GCC CTG GCC AAC CTC AAT GGG TCC AGG CAG 96 
Ala Ser Thr Leu Ala Leu Ala Leu Ala Asn Leu Asn Gly Ser Arg Gin 
20 25 30 

CGG TCG GGG GAC AAC CTC GTG TGC CAC GCC CGT GAC GGC AGC ATC CTG 144 
Arg Ser Gly Asp Asn Leu Val Cys His Ala Arg Asp Gly Ser lie Leu 
35 40 45 

GCT GGC TCC TGC CTC TAT GTT GGC CTG CCC CCA GAG AAA CCC GTC AAC 192 
Ala Gly Ser Cys Leu Tyr Val Gly Leu Pro Pro Glu Lys Pro val Asn 
50 55 60 

ATC AGC TGC TGG TCC AAG AAC ATG AAG GAC TTG ACC TGC CGC TGG ACG 240 
He Ser Cys Trp Ser Lys Asn Met Lys Asp Leu Thr Cya Arg Trp Thr 
65 70 75 80 

CCA GGG GCC CAC GGG GAG ACC TTC CTC CAC ACC AAC TAC TCC CTC AAG 288 
Pro Gly Ala His Gly Glu Thr Phe Leu His Thr Asn Tyr Ser Leu Lys 
85 90 95 

TAC AAG CTT AGG TGG TAT GGC CAG GAC AAC ACA TGT GAG GAG TAC CAC 336 
Tyr Lys Leu Arg Trp Tyr Gly Gin Asp Asn Thr Cys Glu Glu Tyr His 
100 105 110 

ACA GTG GGG CCC CAC TCC TGC CAC ATC CCC AAG GAC CTG GCT CTC TTT 384 
Thr val Gly Pro His Ser Cys His He Pro Lys Asp Leu Ala Leu Phe 
115 120 125 

ACG CCC TAT GAG ATC TGG GTG GAG GCC ACC AAC CGC CTG GGC TCT GCC 432 
Thr Pro Tyr Glu He Trp Val Glu Ala Thr Asn Arg Leu Gly Ser Ala 
130 135 140 

CGC TCC GAT GTA CTC ACG CTG GAT ATC CTG GAT GTG GTG ACC ACG GAC 480 
Arg Ser Asp Val Leu Thr Leu Asp He Leu Asp Val val Thr Thr Asp 
145 150 155 160 

CCC CCG CCC GAC GTG CAC GTG AGC CGC GTC GGG GGC CTG GAG GAC CAG 528 
Pro Pro Pro Asp Val His Val Ser Arg Val Gly Gly Leu Glu Asp Gin 
165 170 175 

CTG AGC GTG CGC TGG GTG TCG CCA CCC GCC CTC AAG GAT TTC CTC TTT 576 
Leu Ser Val Arg Trp Val Ser Pro Pro Ala Leu Lys Asp Phe Leu Phe 
180 185 190 

CAA GCC AAA TAC CAG ATC CGC TAC CGA GTG GAG GAC AGT GTG GAC TGG 624 
Gin Ala Lys Tyr Gin He Arg Tyr Arg Val Glu Asp Ser Val Aep Trp 
195 200 205 

AAG GTG GTG GAC GAT GTG AGC AAC CAG ACC TCC TGC CGC CTG GCC GGC 672 
Lys Val Val Asp Asp val Ser Asn Gin Thr Ser Cys Arg Leu Ala Gly 
210 215 220 

CTG AAA CCC GGC ACC GTG TAC TTC GTG CAA GTG CGC TGC AAC CCC TTT 720 
Leu Lys Pro Gly Thr Val Tyr Phe Val Gin Val Arg Cys Asn Pro Phe 
225 230 235 240 

GGC ATC TAT GGC TCC AAG AAA GCC GGG ATC TGG AGT GAG TGG AGC CAC 763 
Gly He Tyr Gly ser Lys Lys Ala Gly He Trp Ser Glu Trp Ser His 
245 250 255 

CCC ACA GCC GCC TCC ACT CCC CGC AGT GAG CGC CCG GGC CCG GGC GGC 816 
Pro Thr Ala Ala Ser Thr Pro Arg Ser Glu Arg Pro Gly Pro Gly Gly 
260 265 270 

GGG GCG TGC GAA CCG CGG GGC GGA GAG CCG AGC TCG GGG CCG GTG CGG 364 
Gly Ala Cys Glu Pro Arg Gly Gly Glu Pro Ser Ser Gly Pro Val Arg 
275 280 285 
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CGC GAG CTC AAG CAG TTC CTG GGC TGG CTC AAG AAG CAC GCG TAC TGC 912 
Arg Glu Leu Lys Gin Phe Leu Gly Trp Leu Lys Lys His Ala Tyr Cys 
290 295 300 

TCC AAC CTC AGC TTC CGC CTC TAC GAC CAG TGG CGA GCC TGG ATG CAG 960 
Ser Asn Leu Ser Phe Arg Leu Tyr Asp Gin Trp Arg Ala Trp Met Gin 
305 310 315 320 

AAG TCG CAC AAG ACC CGC AAC CAG CAC AGG ACG AGG GGA TCC TGC CCT 1008 
Lya Ser His Lys Thr Arg Asn Gin His Arg Thr Arg Gly Ser Cys Pro 
325 330 335 

CGG GCA GAC GGG GCA CGG CGA GAG GTC CTG CCA GAT AAG CTG TAGGGGCTCA 1060 
Arg Ala Asp Gly Ala Arg Arg Glu Val Leu Pro Asp Lys Leu 
340 345 350 

GGCCACCCTC CCTGCCACGT GGAGACGCAG AGGCCGAACC CAAACTGGGG CCACCTCTGT 1120 

ACCCTCACTT CAGGGCACCT GAGCCCCTCA GCAGGAGCTG GGGTGGCCCC TGAGCTCCAA 1180 

CGGCCATAAC AGCTCTGACT CCCACGTGAG GCCACCTTTG GGTGCACCCC AGTGGGTGTG 1240 

TGTGTGTGTG TGAGGGTTGG TTGAGTTGCC TAGAACCCCT GCCAGGGCTG GGGGTGAGAA 1300 

p GGGGAGTCAT TACTCCCCAT TACCTAGGGC CCCTCCAAAA GAGTCCTTTT AAATAAATGA 1360 

™ GCTATTTAGG TGCAAAAAAA AAAAAAAAAA A 1391 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 
in (A) LENGTH: 350 amino acids 

Tl (B) TYPE: amino acid 

S (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

UJ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

LT Thr Leu Asn Gly Arg Arg Leu Pro Pro Glu Leu Ser Arg Val Leu Asn 

Q 1 5 10 15 

« Ala Ser Thr Leu Ala Leu Ala Leu Ala Asn Leu Asn Gly Ser Arg Gin 

20 25 30 

Arg Ser Gly Asp Asn Leu Val Cys Hi9 Ala Arg Aep Gly Ser lie Leu 
35 40 45 

Ala Gly Ser Cys Leu Tyr Val Gly Leu Pro Pro Glu Lys Pro Val Asn 
50 55 60 

He Ser Cys Trp Ser Lye Asn Met Lys Asp Leu Thr Cys Arg Trp Thr 
65 70 75 80 

Pro Gly Ala His Gly Glu Thr Phe Leu His Thr Asn Tyr Ser Leu Lys 
85 90 95 

Tyr Lys Leu Arg Trp Tyr Gly Gin Asp Asn Thr Cys Glu Glu Tyr His 
100 105 110 

Thr val Gly Pro His Ser Cys His He Pro Lys Asp Leu Ala Leu Phe 
115 120 125 

Thr Pro Tyr Glu He Trp Val Glu Ala Thr Asn Arg Leu Gly Ser Ala 
130 135 140 

Arg Ser Asp Val Leu Thr Leu Asp He Leu Asp Val Val Thr Thr Aep 
145 150 155 160 

Pro Pro Pro Asp Val His Val Ser Arg val Gly Gly Leu Glu Asp Gin 
165 170 175 ' 

Leu Ser Val Arg Trp Val Ser Pro Pro Ala Leu Lys Asp Phe Leu Phe 
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180 



185 



190 



Gin Ala Lys Tyr Gin He Arg Tyr Arg Val Glu Asp Ser Val Asp Trp 
195 200 205 

Lys Val val Asp Asp Val Ser Asn Gin Thr Ser Cys Arg Leu Ala Gly 
210 215 220 

Leu Lys Pro Gly Thr Val Tyr Phe Val Gin Val Arg Cys Asn Pro Phe 



225 



230 



235 



Gly He Tyr Gly Ser Lys Lys Ala Gly lie Trp Ser Glu Trp Ser His 



245 250 
Pro Thr Ala Ala Ser Thr Pro Arg Ser Glu Arg Pro Gly Pro Gly Gly 



260 



265 



Gly Ala Cys Glu Pro Arg Gly Gly Glu Pro Ser Ser Gly Pro Val Arg 
275 280 285 

Arg Glu Leu Lys Gin Phe Leu Gly Trp Leu Lys Lys His Ala Tyr Cys 
290 295 300 

Ser Aen Leu Ser Phe Arg Leu Tyr Asp Gin Trp Arg Ala Trp Met Gin 
305 310 315 320 

Lys Ser His Lys Thr Arg Asn Gin His Arg Thr Arg Gly Ser Cys Pro 
325 330 335 

Arg Ala Asp Gly Ala Arg Arg Glu Val Leu Pro Asp Lys Leu 
340 345 350 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ir) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



TCCAGGCAGC GGTCGGGGGA CAAC 



(2) INFORMATION FOR SEQ ID NO: 27: 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
(6) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
TTGCTCACAT CGTCCACCAC CTTC 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
-(D) TOPOLOGY: linear 




(ii) MOLECULE TYPE: DNA 



hi 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

CCCAGAACTC TTGGACGCTG AGGCAGGAGG ATTCCCAAGT TTCAAGACAG TGTGTTTCTA 60 

GGTAATGAGA CCCTGTCAAG AAAAGAAAAG AAATAAAGAG ACAAGAAAAT GTTTATAGGC 120 

TGTGAGACAG CTTGGTGGGT AAGGGGCACT TGCCTCCAAT CAAGATGACC TCAGCCCCAT 180 

CCCTAGGAAT CCATGGTAGA AGGAGAAAGC AAACTCGCAG CTGCTGACCT CCATACATGT 240 

GCTCCAATGT GCACACACAC AGGGAGACAT AATCAATTAA TAGGATGTAT TTGCTTAGAT 300 

TTGAGTAGGC ATTTATGACT GATGTTTTAA AATTTTTATT TGATTTTATG AAAATATACC 360 

TGTTTGTATT TGGTTTGGTT TGGTTTGAGT TTTGTTTATT TGAGACAGGG CTTCTCTGTG 420 

TAGTCCTGGC TGTCCTTGGA ACTCACTCTG TAGACCAGGC TGGCCTTGAA CTCAGAAATC 480 

CGCCTGCTTG TGCTTCCCAA GTGCTTAGAT TAAAGGTGTG CACTGCCATT CAGCAAAATT 540 

GCATACTTTA ACCCCAGTAT TTGGGAGGCA GAGGCAGACT AATGTGTGAA TTCCAGGCTA $00 

GCCAAGGATA CAGAGTGAGA CCCTATTCTT ACCCTCCCCC CCCAAAACCC CAAAATGTAT 660 

TTTGTGCTTG TGTATGTACA TGTGTGTTGC AGCACGTAAA TGTCCAAGGA CAACTTGTAG 720 

AAGTTCTCTC CGTTCACAGT CTAAGTCCTG AATTCAAACT AAGGTCCTCA GGCTTAGCCA 780 

CAGTCTTCTT TATGTACTGA GCCATTTCAC TGGCCCTGGA TTGACTGATG AATTAATTTT 840 

TGAGATAAGG TCTCTTGTAG CTCTAGCTAG GCTCAAACTA TGAACTCCCA AGGTCATCTT 900 

GAGCTGCTGG TACTCTTGCT TCCACCCCAA GTGGTGGAAT GATACTCAGG CAGCACTTCT 960 

CTGGGGAAGG GGCTGGCCTT GGCCTTGATT TTGTTGCCTC AGCTTCAATG AGTGCTTGGG 1020 

TCTCGTTGTT TCTTTTCTTT ATCTGTGAAA TGGGTGAACA CCTGTTCAAG ACTTCCTGAC 1080 

TCTTGAAACA TCCAGGCAGG GTGAGGGACT TGAAGTGGGC TCATCCCATG CCTAACAAAG 1140 

TGTCGTCTTT GACCCCAGAC ACAGCTGTAA TCAGCCCCCA GGACCCCACC CTTCTCATCG 1200 

GCTCCTCCCT GCAAGCTACC TGCTCTATAC ATGGAGACAC ACCTGGGGCC ACCGCTGAGG 1260 

GGCTCTACTG GACCTTCAAT GGTCGCCGCC TGCCCTCTGA GCTGTCCCGC CTCCTTAACA 1320 

CCTCCACCCT GGCCCTGGCC CTGGCTAACC TTAATGGGTC CAGGCAGCAG TCAGGAGACA 13 80 

ATCTGGTGTG TCACGCCCGA GACGGCAGCA TTCTGGCTGG CTCCTGCCTC TATGTTGGCT 1440 

GTAAGTGGGG CCCCAGACAC TCAGAGATAG ATGGGGGTTG GCAATGACAG ATTTAGAGCC 1500 

TGGGTCTTCT GTCCTGGGGC AG AG C CAT GG GCTCTCACTT GCATGCAGGC ATGGTCATAC 1560 

CCAGCACAGG CATTGCAACT CTAGGGACAG CTGTGGCTGC ACTGTCCCCT GTGTACCCCA 1620 

CAGCTTTAGA AAAGCTGTCA TGTTTTCCTT GTAGTGCCCC CTGAGAAGCC CTTTAACATC 1680 

AGCTGCTGGT CCCGGAACAT GAAGGATCTC ACGTGCCGCT GGACACCGGG TGCACACGGG 1740 

GAGACATTCT TACATACCAA CTACTCCCTC AAGTACAAGC TGAGGTTGGT ACCCAGCCAA 1800 

GCCTTGCTGT GTGACTTCTG GCAATACTTA CCTTCTCTGA TCAAATATGT TCCTGTTTAT 1860 

GAACTCAAAA GGGACTCTCG CACCTCCACA GGTGGTACGG TCAGGATAAC ACATGTGAGG 1920 

AGTACCACAC TGTGGGCCCT CACTCATGCC ATATCCCCAA GGACCTGGCC CTCTTCACTC 1980 

CCTATGAGAT CTGGGTGGAA GCCACCAATC GCCTAGGCTC AGCAAGATCT GATGTCCTCA 2040 
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CACTGGATGT CCTGGACGTG GGTGAGCCCC CAGTGTCCAC CTGTGTTCTG CCCTAGACCT 2100 

TATAGGGCGC CTCCCCCCCA TCCCCCCAGA CTTTTTGGTT CTTCTAGAGG TCTTAGCCAC 2160 

AGCCACGGTG GTTGCAGGAC AGTGGTTGTT CATAACTTAA TGCAAAGACT TTCCCCCAAG 2220 

ACAGTCAAGA TTTTTCCCCT CCCCACCCCC AACACACACA TACACACACA CTCTGCAGAG 2280 

AACACCTGGC CTGACCACCC TCCCTCTCTA CAGCCCAGGT GTTCAGAAGG GAGTCCTAGG 2340 

GGACTGAGAG GAGGCGCCCA GGTCTGAAGG CGCCCCAGGA AGCCGAGGCC TTGAGCTGGG 2400 

GGGGGGGGCG AGGGTTGGAG GCACGAACTG GATGATCCCT GAGCACAACT GGGCCTAATC 2460 

TAATTAGGGT GTTCCCAGCC CAAAGCAGCC TGGGCCATTT AACCCTTCAA GTGCCTCACT 2520 

GAAGACTCAG GGGAGAGATC AGCTTGTACT CTCTCCATGG TCCCCCAGGA GGGTTCCTGG 2580 

GTGCCCCTGG CTCATTCCCA CATCCAGAGG TTTTGTGTCT TCCTCGCATC TAACCCTCAG 2640 

TTGTGCTCTG TGGCTGGCAC AGCTGCCCCG TGGAGGCTCT TGGTAATGTA CAAGGCATCA 2700 

GAGGTGGACA TGGGATGGGG ATACATAGGG ATGGAGCCAA ATAGCACCTC AAGGTGGGGT 2760 

Jj GATATACAAT AAAGCTTGTC ACCCTGACGC TCAGAAAGCC TACTCATGAT GATCACAATT 2820 

Q GTTGACATCA CTCTGGGACA TGTAGTGAGA CCCTAGCTCA AAACACAGAC AGTAGCTTTA 2880 

S AGAGTCAGCT TGTGACTTAA TACTGGAACT CAGGGCCTAA TAGGTGCTGG GTGATGCTCG 2940 

Cfl CCTCACTCCC TGTTTAGTGA GATCTCTGCG CTAATCTCCA CCCCAGCTGG GTGGGCTGCT 3000 

p] CTGTCCCCTT GAGGGCAGGA ATGTGTGTCT TCCATCAGAG ATAGGACCCG TGGTAGCAGC 3060 

* AACTGCTGCT GGCTGTTTCT GGAATATTAA ATGACAGTAA TCTATCAGGC CTGGGTGAGT 3120 

O AGCTAACAGG GGTGGGGGCG TGGTCTGGAA AACGCAGATA GGGTCATAGG AGCCACTGCA 3180 

S GCCTAGATTA CACCACTGGG TGTTCTGTCA CTAGGCCATT CTCACCAAGC AGTCCTCAGA 3240 

p ACTGGGAGCA CTGTTGCCAG CATTTAATGC CAGCATTTAA TGCCAGCATT AGGGGAGGCA 3300 

£ GAGGCAGAAG GATCTCTCTG AGTTCAAGGC CATCCTGAAT TTACATAAAG AGCTCCAGGC 3360 

" CAGCCAGGGT GCGCAGTAAA ACCTTGTCTC AAAAAACAAA GCATCTTTAG TGACCAGGCT 3420 

TGCTCCACCC CCAGTGACCA CGGACCCCCC ACCCGACGTG CACGTGAGCC GCGTTGGGGG 3480 

CCTGGAGGAC CAGCTGAGTG TGCGCTGGGT CTCACCACCA GCTCTCAAGG ATTTCCTCTT 3540 

CCAAGCCAAG TACCAGATCC GCTACCGCGT GGAGGACAGC GTGGACTGGA AGGTGCCCGT 3600 

CCCGCCCCGG ACCCGCCCCT GACCCCGCCC CCCGCATCTG ACTCCTCCCT CACCGTGCAG 3660 

GTGGTGGATG ACGTCAGCAA CCAGACCTCC TGCCGTCTCG CGGGCCTGAA GCCCGGCACC 3720 

GTTTACTTCG TCCAAGTGCG TTGTAACCCA TTCGGGATCT ATGGGTCGAA AAAGGCGGGA 3780 

ATCTGGAGCG AGTGGAGCCA CCCCACCGCT GCCTCCACCC CTCGAAGTGG TGAGCACCTC 3840 

TCCAGGGCTG GCTGGCCCAT GGAATCCCCA ATCCATCCTG TTCCTTCCCC CCCACCCTTT 3900 

TTTTGAGACA GCGTCTTCAG GTAGCGCATG CTGGCCTTAA ATTCAGTATG TAGTCAAGGA 3960 

TGACCTCGAG CTCCTGGTCT TTTTGTCTCC ACTTAGAGAC AATGGCCAGT GGCCATCACC 4020 

ACCTTTGGGA GACTAGCCAT GGAGTCTATT TAGCCTGTCA TTTQGTGACA GATGGAGTAC 4030 

AACAGTGTGA CCTCTTGTAA GAGAACTGAA GACAGGCTGT TTTTAACCCC AATATCCTAG 4140 

GCTCTCTAGA GGTTAACTTT ATATAAAATA GAGACTATTA CAGCCAGTTA TCACATGGTC " 4200 

CCACAGAACC. TTTTGTCACA CAACCTATAG ACCACAGTGC CTGTGCCTAC CACATAAGGG 4260 



p:\0PER\EWPO2146.U, 



r- 1 



88 



TCTCTACTGC 


TGGCCCACCC 


CTCCAACCCT TAAAAGGTAA 


CCTAGGCAGC 


CTTAATATTT 


4320 


GCAATCCTCC 


TACCTCAGCC 


TCTTGAATGC TCAGAAACCA 


GGCATTAACC 


CAAGTTTCTC 


4380 


TTCTCTGGGT 


CCCTTTCTTA 


AGGTGGGAGG GCCTAAAGAT 


GACTTCCTTT 


GTCCTGAAGA 


4440 


CTCTCCGAGC 


CCATGGATCT 


GCACTCTCTA ATATGAAATA 


TATTGCATAA 


AATGTCTGGC 


4500 


CTCAGTTTCC 


CCACCTGTCA 


GGTTTAGGCA GCACAGTCGG 


TCCAAGACAC 


TTCATTATTT 


4560 


GCAGGCAGTA 


TAAGAAGAAG 


CTCCCATCCC CCACCCGCTT 


CCTCCGGTCC 


CTAAGACAGA 


4620 


ATACTTCTAC 


ACTGAAACTG 


AACTCTCGCA GACGCATATG 


CTCACTTTAA 


TGATGATGAA 


4680 


ATAATGGGGA 


AACTGAGGCT 


CCGAGAGATT CCTGGAGGAA 


GAGGGTCAAA 


ACCAGCTCCA 


4740 


GGAAGCTCTC 


CAGCCCCCAT 


CCGGGCCTCT CCAGGTTCTG 


GGCTTGGCGG 


GAGTGAACAC 


4800 


AGCTGGGAGG 


GGCTGGAGCC 


TGGGAGCTTT GGCCCTTGCT 


CGTGCCCAGC 


ACCTGCGATT 


4860 


CTTGCACGGG 


AGCCAGCAGG 


CGGCTGCGTC CGCCCGAGAG 


ACTGAAGAAG 


CCGGGGGTAG 


4920 


GGTTGGAGGG 


AGGTAAGCAG 


GGGCTGTGGG GGCCGAAGCT 


TGTGCCAGGG 


CCTGTCAGCG 


4980 


AGTCCCCAGT 


TTTATTTATG 


GCGTGAGGCC GATGTCCTTA 


TCCGCTGGCC 


TGCTGGGGGA 


5040 


TGGCTGCGGC 


TGGGGATTGG 


ACCCAAGGGC TGGCTTCCCA 


CTCAGTCCTC 


CAGCCCACTC 


5100 


CATGTCACAC 


CCGTGCATTC 


TCTGAGGCTT ATCTTGGGAA 


CCCGCCCTTG 


TTCTGTGCTG 


5160 


TCTGTCTCTA 


TTTCTGTCAT 


TCACTTTCCC AGAGCCTTTT 


TTTTATGCTT 


TTAATATAAC 


5220 


TACGTTTTAA 


AAATTGCTTT 


TGTATAATGT GTGTGCCTTC 


GTGAGCGTGC 


GTGCCACAAC 


5280 


ACACACGTGA 


AGGTTAGAGA 


ACTTTGTTGA GTAGGCTCCT 


TCCACCATGT 


GGGACTAGGG 


5340 


CTGGCGACAA 


GAGCAATTAC 


TGACTCATCT CGCCAGCCCC 


TCACCCCTCA 


CTTCCCATCC 


5400 


TGTTTGGATA 


GTCATAGGTA 


ATCGAAGGTA AATCGCTGGC 


TTTAATTTCG 


TAGCTATCCT 


5460 


GCCTCAGCCT 


ACCAAGTGCT 


GTGCTACCAC GTTTGTGGGA 


GGGGCTCTCC 


TCCCAGTGTC 


5520 


TGGGGGTGAC 


ACAGTCCCAA 


GATCTCTGCT TTCTAGGTCT 


TTGTCTTAGT 


TTGCCCCTTG 


5580 


CTTTGTCCGT 


GTCCCTAGAG 


TCTCCGGCCC CACTTATCCA 


TTGACTGGTC 


TTTCCTTTAC 


5640 


CGAATACTCG 


GTTTTACCTC 


CCACTGATTT GACTCCCTCC 


TTTGCTTGTC 


TCCATCGCCG 


5700 


TGGCATTGCC 


ATTCCTCTGG 


GTGACTCTGG GTCCACACCT 


GACACCTTTC 


CCAACTTTCC 


5760 


CCAGCCGAAG 


CTGGTCTGGT 


ATGGGAGGCC GCCGTCCCGC 


CCGCGCCTCC 


TGCTGGCCGC 


5820 


GCCCCAACAC 


TGCCGCTCCA 


TTCTCTTTAG AGCGCCCGGG 


CCCGGGCGGC 


GGGGTGTGCG 


5880 


AGCCGCGGGG 


CGGCGAGCCC 


AGCTCGGGCC CGGTGCGGCG 


CGAGCTCAAG 


CAGTTCCTCG 


5940 


GCTGGCTCAA 


GAAGCACGCA 


TACTGCTCGA ACCTTAGTTT 


CCGCCTGTAC 


GACCAGTGGC 


6000 


GTGCTTGGAT 


GCAGAAGTCA 


CACAAGACCC GAAACCAGGT 


AGGAAAGTTG 


GGGGAGGCTT 


6060 


GCGTGGGGGG 


TAAAGGAGCA 


GAGGAAGAGA GAGACCCGGG 


TGAGCAGCCT 


CCACAACACC 


6120 


GCACTCTTCT 


TTCCAAGCAC 


AGGACGAGGG GATCCTGCCC 


TCGGGCAGAC 


GGGGTGCGGC 


6180 


GAGAGGTAAG 


GGGGTCTGGG 


TGAGTGGGGC CTACAGCAGT 


CTAGATGAGG 


CCCTTTCCCC 


6240 


TCCTTCGGTG 


TTGCTCAAAG 


GGATCTCTTA GTGCTCATTT 


CACCCACTGC 


AAAGAGCCCC 


6300 


AGGTTTTACT 


GCATCATCAA 


GTTGCTGAAG GGTCCAGGCT 


TAATGTGGCC 


TCTTTTCTGC 


6360 


CCTCAGGTCC 


TGCCGGCTAA 


AC TCTAAGG A TAGGCCATCC 


TCCTGCTGGG 


TCAGACCTGG 


6420 


AGGCTCACCT 


GAATTGGAGC 


CCCTCTGTAC CATCTGGGCA 


ACAAAGAAAC 


CTACCAGAGG 


6480 
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CTGGGCACAA TGAGCTCCCA CAACCACAGC TTTGGTCCAC ATGATGGTCA CACTTGGATA 6540 

TACCCCAGTG TGGGTAGGGT TGGGGTATTG CAGGGCCTCC CAAGAGTCTC TTTAAATAAA 6600 

TAAAGGAGTT GTTCAGGTCC CGATGGCCAG TGTGTTTGGG GCCTATGTGC TGGGGTGGGG 6660 



(2) INFORMATION FOR SEQ ID NO : 2 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 186 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Asp Pro Thr Leu Leu He Gly Ser Ser Leu Gin Ala Thr Cys Ser He 
15 10 15 

His Gly Asp Thr Pro Gly Ala Thr Ala Glu Gly Leu Tyr Trp Thr Phe 
20 25 30 

Asn Gly Arg Arg Leu Pro Ser Glu Leu Ser Arg Leu Leu Asn Thr Ser 
35 40 45 

Thr Leu Ala Leu Ala Leu Ala Asn Leu Asn Gly Ser Arg Gin Gin Ser 
50 55 60 

Gly Asp Asn Leu Val Cys His Ala Arg Asp Gly Ser He Leu Ala Gly 
65 70 75 60 

Ser Cys Leu Tyr Val Gly Leu Pro Pro Glu Lys Pro Phe Asn He Ser 
85 90 95 

Cys Trp Ser Arg Asn Met Lys Asp Leu Thr Cys Arg Trp Thr Pro Gly 
100 105 HO 

Ala His Gly Glu Thr Phe Leu His Thr Asn Tyr Ser Leu Lys Tyr Lys 
115 120 125 

Leu Arg Leu Val Arg Ser Gly * His Met * Gly Val Pro His Cys 
130 135 140 

G-y Pro Ser Leu Met Pro Tyr Pro Gin Gly Pro Gly Pro Leu His Ser 
145 150 155 160 

Leu * Asp Leu Gly Gly Ser His Gin Ser Pro Arg Leu Ser Lys He 
165 170 175 

* Cys Pro His Thr Gly Cys Pro Gly Arg 
180 185 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid - 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE. TYPE : DNA 



* 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
AGCTGGCGCG CCTCCCGGGC GGATCGGGAG CCCAC 35 

<2) INFORMATION FOR SEQ ID NO:31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

2 6 

AGCTACGCGT TTAGAGTTTA GCCGGCAG 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: eingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Met Val Leu Ala Ser Ser Thr Thr Ser He His Thr Met Leu Leu Leu 
1 5 10 15 

Leu Leu Met Leu Phe His Leu Gly Leu Gin Ala Ser He Ser 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

lie Lye Pro Ser Gly Arg Arg Gly Ala Ala Arg Gly Pro Ala Gly Asp Tyr Lys Asp Mp 
5 10 i5 cyj 

Asp Asp Lys 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 base pairs . 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
' (D) TOPOLOGY: linear 
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<ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 



GATCTTGCCC TCGGGCAGAC GGGGTGCGGC GAGAGGTCCT GCCGGCGACT ACAAGGACGA 
CGATGACAAG TAG 



(2) INFORMATION FOR SEQ ID NO: 35: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 



AACGGGAGCC CGTCTGCCCC ACGCCGCTCT CCAGGACGGC CGCTGATGTT CCTGCTGCTA 
CTGTTCATCC TAG 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
CCCACGCTTC TCATCGGATT CTCCCTG 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 



■ CAGTCCACAC TGTCCTCCAC TCGGTAG * 



(2) INFORMATION FOR SEQ ID NO: 38: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11832 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 

GCGGCCGCTG CAGTGATTAC TCACCGCGTG GCGCACCCCA CCCGCGGGCC GCTGAGTGGA 60 

TTTTTCCGTG GGGGGATGTG AAGAAGTTTA GGGAGAACTC TTCTGCACCG ATGGGAACTA 120 

GGAATGCAGG GTTCGGTCCC GTTCCCCAAA GGACACACCT CTCCCCATAA GCCCACTCAT 180 

AAGGGCTCCC TGCACGCGCT CCGGGACATC CCCATATCCA ATACCCGCAG ATATGATAGT 240 

TGAGAAGGGA CCAGAGGCCG GAGACTCCCT CCCTGCCTTC TGGCTTTCCC CCCCCCCTGC 300 

ACGAAACGAG ACTACAGCGA TGGGAGAGGT GGCATGAAGG CTTAGGGTGG GGATCGGTAG 3 60 

GACCCATGCA CCCAGAGAAA GGGACTGGTG GCAACTTTCA AACTCTCTGG GGAAGGAAGA 420 

j5i AGGGCTGAAA GAGGATGAAC GGGCTCAGGT ACTGCTCAAT GTGTGTGTGG CGGACCAAAG 480 

Q TGGGTATGGG GGCCCCGTAA GAGGGGCGGG GAAGGTGGAT AGGAAGGATC CCGGTAGACT 540 

* - 5 

GGAGGGGATC CTGGAAAAGC ACCAGGGCTG CGAGCTAGGA ACCCATTCGG AGTTAAGGGT 600 

(f\ ACAGGATCCC AGATGAGGGG GTGGGAAGCC TGGGACGGGC GGGACCAGAG AGGGAGGTCC 660 

LTI CACGGGCTGG TGGGGAAAGA GTGGGGGGCT TCGCGCAGGA GGATGGGACG TTCAGGAGTG 720 

^ GTAACTGGGC GGAGGCCGGC CGGGCGGGGC GCGCGGTGCC CGCGGGCGGT GGGAAGGCCG 780 

□ GTGCGGGGCC CACGATCAAC CCCCCCCCAG GGGCCGGGCC GGGCCGGGGG CGGGGCCGGG 840 

LU CGGGGCGAGC GGCGCATTAG CGCCTTGTCA ATTTCGQCTG CTCAGACTTG CTCCGGCCTT 900 

lT CGCTGTCCGC GCCCAGTGAC GCGCGTGAGG ACCCGAGCCC CAATCTGCAC CCCGCAGACT 960 

J3 CGCCCCCGCC CCATACCGGC GTTGCAGTCA CCGCCCGTTG CGCGCCACCC CCATGCCCGC 1020 

03 GGGTCGCCCG GGCCCCGTCG CCCAATCCGC GCGGCGGCCG CCGCGGCCGC TGTCCTCGCT 1080 

GTGGTCGCCT CTGTTGCTCT GTGTCCTCGG GGTGCCTCGG GGCGGATCGG GAGCCCGTGA 1140 

GTACCGTGCG CCCTGCTCCC CACCTCCCCA GGGAAGCCSG GATCCGGCGC CCCGGGGGGT 1200 

AGTCGCGGGG GATGGAAGAA GGGGCGCGAG CGCCACCTGG ACGTCCCGGG AACAAAGGAA 1260 

GGCGGCCCTC GGGGCGCCCT CACCTGTGGG GCTCATGGCA CCACCACCCA GCCTCCCAAG 1320 

AGTACCCCGT TATACATCAG AGGCCTCTTA TCTGTATCCC CTTTGCGAGG CTGTCTGGCC 1380 

AGGCTCAGTT TGAAGGACAT CGCAGTGTCC TGGGACCCCC CTCCTTCAGG GTGCTGGGAC 1440 

GCTTCGGGGC GCACGCCTGT GTCTTGGATA TCAGAGCGGA AGGGAAGCCT CCCTGGCCGG 1500 

GGGCGCACGC TTGGGTGCGT TGGGTTGGGT GCTGGCGCAA AGTGGGGTCC CCTCCCCCAT 1560 

GAAGTGATGA TCCCCGGGGG GAGGGTGGGG CGTTATCGTG AGCCCTCCTG TCCGCCTGGC 1620 

ATGCGGCCCG GCGTCCCTCG GGACTTGCCT CTCCGTGGGG TCGGCGCCGC CCCCTCCCCC 1680 

CTATAGCAGA CTCCATGCTT TGGTATCCTC GAAGTCCTCT CCACTGGTGG GGCTCACAAC 1740 

CGGTCTCATT CAGGCTGCGC TGGGTTGAGA GCCTCTAGCG ACTGAAATTT CGGTGAGGAG 1800 

CGAGAGCAAG CGTGTCCGGG CACCGCGAGC CCAGACTTCA TTGTCTAAGG GGCACCCAGT 1860 
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GGGGGTCAGC TGCCGAGAGA ATCCCACTOT CCCAGGAGGA ACTCCTGGCC TTGAGCCCCC 1920 

ATCACCCAAC GCACACATCC CCGCCAGGAT GCGGTCTCCA CATCCAGACC CTCTCTGGGA 1980 

CACACCCAAA GACACACAAA AGAGCCCCAC TGGCTTATGT CCCGTCACCC TGCCCTCCGA 2040 

CGCGCGCTGC AGCCCAGATG CGTATTCGCA CACCATCGCG GCGCTCGCAT TCCATCCTCT 2100 

ACACACACAC ACACACACAC ACACACACAC ACACACACAC ACACACAGAC ACGCACACAC 2160 

ACACGCACGC ACACACACGC ACGCCCGCAC TCGTGGTCCC ACATTTATTT CACAGGGGAG 2220 

GCAACACCGG GGTACGCATA TGGTTGAGTG CACTGGAGAT CTTTCCCCAC CACTCTCAGG 2280 

ACCCCATCCG GAGACACAGG CCACACCGCA GGGGCACCAC GCTGCGCTGC TGCTCTGGGC 2340 

TAGTAGTCTT GTGCAGTTTG TCCGCGGTGT CTGTGGACGC CCTCCCGCTC TTGTCAGGGG 2400 

ACAGGAACCT ACACTCCTGC TTGCCCAAGG CGGCTGGGCA GGTGATGTGG TGACACCCGG 2460 

GACCTTTCCG GGGAGTTGGT GTTGCTGCCA AGCCTGGGTA GTTTTTGAAT GCCACCAATA 2520 

GCGCTAAGCT TTGTTTCCGG GCGGGCTGCA GAGCAACAGG CGAAGGTGGC GGAGTGGGGG 2580 

TGGCGCGTGT GTTTTTTCTT TTAAGGGGGA GAGAAATTAA ATAAGAGGTT CTCACACCTC 2640 

TGCAATCTGT TTGTACTTAC CGTGTGTCTT AACACCTGAC CAGCCAGCCG GTGGGTCGTA 2700 

AAAGTGTATG CAGGTACCAG CGGGACAGGA GKTGGGGGCC CCTGGGGTAT GGCTGGGATG 2760 

GAGGCCACCT TCCCGTTGGC CTTTCAGGGA ATCTCACACT TTTCCCTTTT AAAACACATG 2820 

GTGTTCTTTT TAATAACGGC AGCAACTCCG CATTGGGAAA GGGGGAAATA AGCTTGTATA 2880 

GGCCCCGGCT TTGTGGAAAG GAGGGGAAGA GGGAAGAAAA AAGGAGGGGT GTCTCCTCCA 2940 

GGCTTAGGGG GCTGTCAGCT GCTGCTCTGT CTAGCTTGGC ATGTGTGTGC CCCAGTCCCC 3000 

AGTGGCTTTG GCCCATTGTT TGTGGAAGCC AAGAGGGAGA CTGGAGTCCT CTATCTCTGG 3060 

TACTCCAGAG TCAGGCTTCT CAGTCCGAGC CCAGAGAACG TCTTCCCTGT TTTATGGAGG 3120 

GAATCAGGGA AGGGGGTGCC AGGTGGACTA CGTTCTGCTG AGGACTGTAC CAGTCGCTCG 3180 

AAGGAGAAAG CTTGGGCTTG CCCCCCTCCC CCCTCAAGCC ACGAAGGGCA GCTGCTAGGC 3240 

TAGTGTGGTA AAAGGGCATT ACTCCCCAGC CAGGACCCCC CAGAGAGTCC CCTTCCTGGC 3300 

CAGACAAATG CTGGGGAGGG ACAGAGGGGT GTGATCATTG CCCAGGAGTG CAGACAGTGG 3360 

GGTCCCGGGT CGGGCAGTGC CTCCCACCCT GCTGAGGGGG GCGCCCAGGC AGGAAGCGGT 3420 

GGGTGGGCCG GGGTAGAGAC GCTGGCACGT CCCAGTTCAT GCCGAAGGAA TTCTGAATTA 3490 

GCGGGCGGCT GGCTGCCTGG GACCTCCGGG GCGGCCCCCT GGCCCCCGCC GCTCCGTCTG 3540 

GCCTGCTCCT CCTGCTCCTT CGCACGGACG CTGAGACCTC CGCTGAGCCC TGGGACAAGC 3600 

CCCAAATGCA ACTGCGATTG CAGGCTTCGC AAGACCCGCC TCCTCCCAAG GCCAAATTTG 3660 

CCTGGGAGAA GTCATTCAGG GCCCAGACTA GAACCATGTT GGTGCCACCT CATCCATCTG 3720 

GGGCATGAAG GACCGTCCAG GGCTGCAGTT TAGCTTCTTA ATAGGAACCT GGGGGTGGGT 3780 

GCAGCCTCTG TTCTCCGAGC CTCTTTGGAA ATCGGTTTTG TTTTTGTTTT TGTTTTTTCC 3840 

AATACTCTTT TCCTCTCATC CCATCCCGGG ACTGTTTTCC TCCCTAAGGG TTGAGAGCCC 3900 

TGCAGTCTTC CCTAACCTTT TCTTTGCTTC TACCCCAGGG CCTTTGCACA TGGAGTCCCA 3960 

CCTCTCCCCT TGCCCAACTG GGGCTCCAGC CTTACTGCAT TTGGCTCTTG GTAACTGTCC 4020 

CAGGGCCTCT CTGACACACA GGGTTGTAGC CCCAGCTCCC TCTCTTCTCC TCCCCCCTTT 4060 



# # 
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CTCTTTTGCT TCTGAGACTT AATTTTTTTC TTTTTCTTTT TGGCTTTTTG AGACAGGGTT 


4140 


TCTCTGTACA 


GCCCTGGCTG 


CCCTGGCACT 


CATTCTGTAG 


ACCAGGCTAG CCTCAAACTC 


4200 


ACAAACCTAC 


CTGCCTCTGC 


CTTTCCAGTG 


CTGGCACTAA 


AGATGTGGGC CACCACAACT 


4260 


AGTAGTTAAG 


TGTTTTGCTG 


TGTCTTTATT 


CCTATAGTGA 


CCTCAGTTCC TGGCATATTG 


4320 


TAGGCGATGG 


ATGGATGAAT 


GGATGGATGG 


ATGGATGGAT 


GGATGGTTGG ATGGAGCAAG 


4380 


CTTGAATCGT 


CCTGAGTGAA 


AAAAGAGACC 


TCAGAGAACT 


GAATGGAGTT AGGTTCCCAG 


4440 


GGCAGCCTGG 


CCTGCTGGTC 


TCATGGGAGC 


TCCCTGTGAA 


ACTTCCCCCA CACCTCCCAC 


4500 


CACCCTGCCA 


TCCTGTGTGG 


CTGACAAGAA AGGCCAATGG 


CCAGATGGGG ACACAGACTC 


4560 


AGGGAAGCTT 


GGAATATGTT 


CCCCTCCTCA TATCCTAGGC 


CTTGTTGTCC CCCTGAGGGC 


4620 


CCAGCCTATG 


AGTAGGGCAG 


CTGTGGGCTG 


CCCTAAGGTT 


GGGTAGGCAA GAAGGGGGTG 


4680 


GTCCCTCAGG 


GTGGGTCACA 


GGATTGAGGT 


CATTTCCAAA 


GTGGCCATCA CAGTGGCCCT 


4740 


AGGAAATGAT 


TGTGGAGAGT 


CAGAACTCCT 


GTTGGGAGTT 


GTAGAGGGCC TTGCATGTGG 


4800 


GCTTCTGTGG 


CTGTCCCTTC 


TCTTGTGGTC 


CTTTGCACAG 


TCCCCTCGTG TGTGCTGGGA 


4860 


TGTGAGGAGG 


GCACGGGGAA 


AATGAAGGCT 


CAGCCCCTCA 


GCTTGCCCTT CACGGTTCAC 


4920 


CCAACAGGGC 


TCACCTCTCC 


TCTGGACAGG 


CTCTCACTGT 


ATGCACAGAT TGGCCTCACA 


4980 


TTTGATTCCC 


TTCCTTTGG? 


CTCCTGGGAT 


GACAAACATT 


TACCAGGGTA GGATTTTACA 


5040 


TTTTAGATAT 


GTCCATTCTC 


CAGAAACACA 


CTTGTGAGGT 


TAGGGTATCA GTGAAAGGAC 


5100 


ACCACCAGGA 


CAGACAAAGA 


ATTGGAGAGG 


AAGGAAATTG 


GTAAGCCAGG CCATGCTTGA 


5160 


TCGCTTATGT 


GTAATCCCAG 


AACTCTGGAC 


GCTGAGGCAG 


GAGGATTCCA AGTTTCAAGA 


5220 


CAGTGTGTTC 


TAGGTAATGA 


GACCCTGTCA 


AGAAAAGAAA 


AGAAATAAAG AGACAAGAAA 


5230 


ATGTTTATAG 


GCTGTGAGAC 


AGCTTGGTGG 


GTAAGGGGCA 


CTTGCCTCCA ATCAAGATGA 


5340 


CCTCAGCCCC 


ATCCCTAGGA 


ATCCATGGTA 


GAAGGAGAAA 


GCAAACTCCA GCTGCTGACC 


. 5400 


TCCATACATG 


TGCTCCAATG 


TGCACACACA 


CAGGGAGACA 


TAATCAATTA ATAGGATGTA 


5460 


TTTGCTTAGA TTTGAGTAGG 


CATTTATGAC 


TGATGTTTTA 


AAATTTTTAT TTGATTTTAT 


5520 


GAAAATATAC 


CTGTTTGTAT 


TTGGTTTGGT 


TTGGTTTGAG 


TTTTGTTTAT TTGAGACAGG 


5580 


GCTTCTCTGT 


GTAGTCCTGG 


CTGTCCTTGG 


AACTCACTCT 


GTAGACCAGG CTGGCCTTGA 


5640 


ACTCAGAAAT 


CCGCCTGCTT 


GTGCTTCCCA 


AGTGCTTAGA 


TTAAAGGTGT GCACTGCCAT 


5700 


TCAGCAAAAT 


TGCATACTT? 


AACCCCAGTA 


TTTGGGAGGC 


AGAGGCAGAC TAATGTGTGA 


5760 


ATTCCAGGCT 


AGCCAAGGAT 


ACAGAGTGAG 


ACCCTATTCT 


TACCCTCCCC CCCCAAAACC 


5320 


CCAAAATGTA 


TTTTGTGCTT 


GTGTATGTAC 


ATGTGTGTTG 


CAGCACGTAA ATGTCCAAGG 


5880 


ACAACTTGTA GAAGTTCTC? 


CCGTTCACAG 


TCTAAGTCCT 


GAATTCAAAC TAAGGTCCTC 


5940 


AGGCTTAGCC 


ACAGTCTTCT 


TTATGTACTG 


AGCCATTTCA 


CTGGCCCTGG ATTGACTGAT 


6000 


GAATTAATTT 


TTGAGATAAG 


GTCTCTTGTA 


GCTCTAGCTA 


GGCTCAAACT ATGAACTCCC 


6060 


AAGGTCATCT 


TGAGCTGCTG 


GTACTCTTGC 


TTCCACCCCA 


AGTGGTGGAA TGATACTCAG 


6120 


GCAGCACTTC 


TCTGGGGAAG 


GGGCTGGCCT 


TGGCCTTGAT 


TTTGTTGCCT CAGCTTCAAT 


6180 


GAGTGCTTGG 


GTCTCGTTGT 


TTCTTTTCTT 


TATCTGTGAA 


ATGGGTGAAC ACCTGTTCAA 


6240 


GACTTCCTGA 


CTCTTGAAAC 


ATCCAGGCAG 


GGTGAGGGAC 


TTGAAGTGGG CTCATCCCAT 


6300 
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GCCTAACAAA 


GTGTCGTCTT TGACCCCAGA 


CACAGCTGTA 


ATCAGCCCCC 


AGGACCCCAC 


6360 


CCTTCTCATC 


GGCTCCTCCC TGCAAGCTAC 


CTGCTCTATA 


CATGGAGACA CACCTGGGGC 


6420 


CACCGCTGAG 


GGGCTCTACT GGACCTTCAA 


TGGTCGCCGC 


CTGCCCTCTG 


AGCTGTCCCG 


6480 


CCTCCTTAAC 


ACCTCCACCC TGGCCCTGGC 


CCTGGCTAAC 


CTTAATGGGT 


CCAGGCAGCA 


6540 


GTCAGGAGAC 


AATCTGGTGT GTCACGCCCG 


AGACGGCAGC 


ATTCTGGCTG 


GCTCCTGCCT 


6600 


CTATGTTGGC 


TGTAAGTGGG GCCCCAGACA 


CTCAGAGATA 


GATGGGGGTT 


GGCAATGACA 


6660 


GATTTAGAGC 


CTGOGTCTTC TGTCCTGGGG 


CAGAGCCATG 


GGCTCTCACT 


TGCATCCAGG 


6720 


CATGGTCATA 


CCCAGCACAG GCATTGCAAC 


TCTAGGGACA 


GCTGTGGCTG 


CACTGTCCCC 


6780 


TGTGTACCCC 


ACAGCTTTAG AAAAGCTGTC 


ATGTTTTCCT 


TGTAGTGCCC 


CCTGAGAAGC 


6840 


CCTTTAACAT 


CAGCTGCTGG TCCCGGAACA 


TGAAGGATCT 


CACGTGCCGC 


TGGACACCGG 


6900 


GTGCACACGG 


GGAGACATTC TTACATACCA 


ACTACTCCCT 


CAAGTACAAG* CTGAGGTTGG 


6960 


TACCCAGCCA 


AGCCTTGCTG TGTGACTTCT 


GGCAATACTT 


ACCTTCTCTG 


ATCAAATATG 


7020 


TTCCTGTTTA 


TGAACTCAAA AGGGACTCTC 


GCACCTCCAC 


AGGTGGTACG 


GTCAGGATAA 


7080 


CACATGTGAG 


GAGTACCACA CTGTGGGCCC 


TCACTCATGC 


CATATCCCCA 


AGGACCTGGC 


7140 


CCTCTTCACT 


CCCTATGAGA TCTGGGTGGA 


AGCCACCAAT 


CGCCTAGGCT 


CAGCAAGATC 


7200 


TGATGTCCTC 


ACACTGGATG TCCTGGACGT 


GGGTGAGCCC 


CCAGTGTCCA 


CCTGTGTTCT 


7260 


GCCCTAGACC 


TTATAGGGCG CCTCCCCCCC 


ATCCCCCCAG 


ACTTTTTGGT 


TCTTCTAGAG 


7320 


GTCTTAGCCA 


CAGCCACGGT GGTTGCAGGA 


CAGTGGTTGT 


TCATAACTTA 


ATGCAAAGAC 


7380 


TTTCCCCCAA 


GACAGTCAAG ATTTTCCCCT 


CCCCACCCCC 


AACACACACA 


TACACACACA 


7440 


CTCTGCAGAG 


AACACCTGGC CTGACCACCC 


TCCCTCTCTA 


CAGCCCAGGT 


GTTCAGAAGG 


7500 


GAGTCCTAGG 


GGACTGAGAG GAGGCGCCCA 


GGTCTGAAGG 


CGCCCCAGGA 


AGCCGAGGCC 


7560 


TTGAGCTGGG 


GGGGGGGGCG AGGGTTGGAG 


GCACGAACTG 


GATGATCCCT 


GAGCACAACT 


7620 


GGGCCTAATC 


TAATTAGGGT GTTCCCAGCC 


CAAAGCAGCC 


TGGGCCATTT 


AACCCTTCAA 


7680 


GTGCCTCACT 


GAAGACTCAG GGGAGAGATC 


AGCTTGTACT 


CTCTCCATGG 


TCCCCCAGGA 


7740 


GGGTTCCTGG 


GTGCCCCTGG CTCATTCCCA 


CATCCAGAGG 


TTTTGTGTCT 


TCCTGGCATC 


7800 


TAACCCTCAG 


TTGTGCTCTG TGGCTGGCAC 


AGCTGCCCCG 


TGGAGGCTCT 


TGGTAATGTA 


7860 


CAAGGCATCA 


GAGGTGGACA TGGGATGGGG 


ATACATAGGG 


ATGGAGCCAA 


ATAGCACCTC 


7920 


AAGGTGGGGT 


GATATACAAT AAAGCTTGTC 


ACCCTGACGC 


TCAGAAAGCC 


TACTCATGAT 


7980 


GATCACAATT 


GTTGACATCA CTCTGGGACA 


TGTAGTGAGA 


CCCTAGCTCA 


AAACACAGAC 


8040 


AGTAGCTTTA 


AGAGTCAGCT TGTGACTTAA 


TACTGGAACT 


CAGGGCCTAA 


TAGGTGCTGG 


8100 


GTGATGCTCG 


CCTCACTCCC TGTTTAGTGA 


GATCTCTGCG 


CTAATCTCCA 


CCCCAGCTGG 


8160 


GTGGGCTGCT 


CTGTCCCCTT GAGGGCAGGA 


ATGTGTGTCT 


TCCATCAGAG 


ATAGGACCCG 


8220 


TGGTAGCAGC 


AACTGCTGCT GGCTGTTTCT 


GGAATATTAA 


ATGACAGTAA 


TCTATCAGGC 


8280 


CTGGGTGAGT 


AGCTAACAGG GG7GGGGGCG 


TGGTCTGGAA 


AACGCAGATA 


GGGTCATAGG 


8340 


AGCCACTGCA 


GCCTAGATTA CACCACTGGG 


TGTTCTGTCA 


CTAGGCCATT 


CTCACCAAGC 


8400 


AGTCCTCAGA 


ACTGGGAGCA CTGTTGCCAG 


CATTTAATGC 


CAGCATTTAA 


TGCCAGCATT 


3460 


AGGGGAGGCA 


GAGGCAGAAG GATCTCTCTG 


AGTTCAAGGC 


CATCCTGAAT 


TTACATAAAG 


8520 
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AGCTCCAGGC CAGCCAGGGT GCGCAGTAAA ACCTTGTCTC AAAAAACAAA GCATCTTTAG 8580 

TGACCAGGCT TGCTCCACCC CCAGTGACCA CGGACCCCCC ACCCGACGTG CACGTGAGCC 8640 

GCGTTGGGGG CCTGGAGGAC CAGCTGAGTG TGCGCTGGGT CTCACCACCA GCTCTCAAGG 9700 

ATTTCCTCTT CCAAGCCAAG TACCAGATCC GCTACCGCGT GGAGGACAGC GTGGACTGGA 87 60 

AGGTGCCCGT CCCGCCCCGG ACCCGCCCCT GACCCCGCCC CCCGCATCTG ACTCCTCCCT 8820 

CACCGTGCAG GTGGTGGATG ACGTCAGCAA CCAGACCTCC TGCCGTCTCG CGGGCCTGAA 8880 

GCCCGGCACC GTTTACTTCG TCCAAGTGCG TTGTAACCCA TTCGGGATCT ATGGGTCGAA 8940 

AAAGGCGGGA ATCTGGAGCG AGTGGAGCCA CCCCACCGCT GCCTCCACCC CTCGAAGTGG 9000 

TGAGCACCTC TCCAGGGCTG GCTGGCCCAT GGAATCCCCA ATCCATCCTG TTCCTTCCCC 9060 

CCCACCCTTT TTTTGAGACA GCGTCTTCAG GTAGCGCATG CTGGCCTTAA ATTCAGTATG 9120 

TAGTCAAGGA TGACCTCGAG CTCCTGGTCT TTTTGTCTCC ACTTAGAGAC AATGGCCAGT 9180 

GGCCATCACC ACCTTTGGGA GACTAGCCAT GGAGTCTATT TAGCCTGTCA TTTGGTGACA 9240 

GATGGAGTAC AACAGTGTGA CCTCTTGTAA GAGAACTGAA GACAGGCTGT TTTTAACCCC 9300 

AATATCCTAG GCTCTCTAGA GGTTAACTTT ATATAAAATA GAGACTATTA CAGCCAGTTA 9360 

TCACATGGTC CCACAGAACC TTTTGTCACA CAACCTATAG ACCACAGTGC CTGTGCCTAC 9420 

CACATAAGGG TCTCTACTGC TGGCCCACCC CTCCAACCCT TAAAAGGTAA CCTAGGCAGC 9480 

CTTAATATTT GCAATCCTCC TACCTCAGCC TCTTGAATGC TCAGAAACCA GGCATTAACC 9540 

CAAGTTTCTC TTCTCTGGGT CCCTTTCTTA AGGTGGGAGG GCCTAAAGAT GACTTCCTTT 9600 

GTCCTGAAGA CTCTCCGAGC CCATGGATCT GCACTCTCTA ATATGAAATA TATTGCATAA 9660 

AATGTCTGGC CTCAGTTTCC CCACCTGTCA GGTTTAGGCA GCACAGTCGG TCCAAGACAC 9720 

TTCATTATTT GCAGGCAGTA TAAGAAGAAG CTCCCATCCC CCACCCGCTT CCTCCGGTCC 9780 

CTAAGACAGA ATACTTCTAC ACTGAAACTG AACTCTCGCA GACGCATATG CTCACTTTAA 9840 

TGATGATGAA ATAATGGGGA AACTGAGGCT CCGAGAGATT CCTGGAGGAA GAGGGTCAAA 9900 

ACCAGCTCCA GGAAGCTCTC CAGCCCCCAT CCGGGCCTCT CCAGGTTCTG GGCTTGGCGG 9960 

GAGTGAACAC AGGTGGGAGG GGCTGGAGCC TGGGAGCTTT GGCCCTTGCT CGTGCCCAGC 10020 

ACCTGCGATT CTTGCACGGG AGCCAGCAGG CGGCTGCGTC CGCCCGAGAG ACTGAAGAAG 10080 

CCGGGGGTAG GGTTGGAGGG AGGTAAGCAG GGGCTGTGGG GGCCGAAGCT TGTGCCAGGG 10140 

CCTGTCAGCG AGTCCCCAGT TTTATTTATG GCGTGAGGCC GATGTCCTTA TCCGCTGGCC 10200 

TGCTGGGGGA TGGCTGCGGC TGGGGATTGG ACCCAAGGGC TGGCTTCCCA CTCAGTCCTC 10260 

CAGCCCACTC CATGTCACAC CCGTGCATTC TCTGAGGCTT ATCTTGGGAA CCCGCCCTTG 10320 

TTCTGTGCTG TCTGTCTCTA TTTCTGTCAT TCACTTTCCC AGAGCCTTTT TTTTATGCTT 10380 

TTAATATAAC TACGTTTTAA AAATTGCTTT TGTATAATGT GTGTGCCTTC GTGAGCGTGC 10440 

GTGCCACAAC ACACACGTGA AGGTTAGAGA ACTTTGTTGA GTAGGCTCCT TCCACCATGT 10500 

GGGACTAGGG CTGGCGACAA GAGCAATTAC TGAGTCATCT CGCCAGCCCC TCACCCCTCA 10560 

CTTCCCATCC TGTTTCGATA GTCATAGGTA ATCGAAGGTA AATCGCTGGC TTTAATTTCG 10620 

TAGC'TATCCT GCCTCAGCCT ACCAAGTGCT GTGCTACCAC GTTTGTGGGA GGGGCTCTCC 10680 

TCCCAGTGTC TGGGGGTACA CAGTCCCAAG ATCTCTGCTT TCTAGGTCTT TGTCTTAGTT 10740 
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TGCCCCTTGC TTTGTCCGTG TCCCTAGAGT CTCCGGCCCC ACTTAGTCTC CATTGATTTC 10800 

CTTTCTGACC GAATACTCGG TTTTACCTCC CACTGATTTG ACTCCCTCCT TTGCTTGTCT 10860 

CCATCGCCGT GGCATTGCCA TTCCTCTGGG TGACTCTGGG TCCACACCTG ACACCTTTCC 10920 

CAACTTTCCC CAGCCGAAGC TGGTCTGGTA TGGGAGGCCG CCGTCCCGCG CGCGCCTCCT 10980 

GCTGGCCGCG CCCCAACACT GCCGCTCCAT TCTCTTTAGA GCGCCCGGGC CCGGGCGGCG 11040 

GGGTGTGCGA GCCGCGGGGC GGCGAGCCCA GCTCGGGCCC GGTGCGGCGC GAGCTCAAGC 11100 

AGTXCCTCGG CTGGCTCAAG AAGCACGCAT ACTGC TCGAA CCTTAGTTTC CGCCTGTACG 11160 

ACCAGTGGCG TGCTTGGATG CAGAAGTCAC ACAAGACCCG AAACCAGGTA GGAAAGTTGG 11220 

GGGAGGCTTG CGTGGGGGGT AAAGGAGCAG AGGAAGAGAG AGACCCGGGT GAGCAGCCTC 11280 

CACAACACCG CACTCTTCTT TCCAAGCACA GGACGAGGGG ATCCTGCCCT CGGGCAGACG 11340 

GGGTGCGGCG AGAGGTAAGG GGGTCTGGGT GAGTGGGGCC TACAGCAGTC TAGATGAGGC 11400 

CCTTTCCCCT CCTTCGGTGT TGCTCAAAGG GATCTCTTAG TGCTCATTTC ACCCACTGCA 11460 

AAGAGCCCCA GGTTTTACTG CATCATCAAG TTGCTGAAGG GTCCAGGCTT AATGTGGCCT 11520 

CTTTTCTGCC CTCAGGTCCT GCCGGCTAAA CTCTAAGGAT AGGCCATCCT CCTGCTGGGT 11580 

CAGACCTGGA GGCTCACCTG AATTGGAGCC CCTCTGTACC ATCTGGGCAA CAAAGAAACC 11640 

TACCAGAGGC TGGGCACAAT GAGCTCCCAC AACCACAGCT TTGGTCCACA TGATGGTCAC 11700 

ACTTGGATAT ACCCCAGTGT GGGTAGGGTT GGGGTATTGC AGGGCCTCCC AAGAGTCTCT 11760 

TTAAATAAAT AAAGGAGTTG TTCAGGTCCC GATGGCCAGT GTGTTTGGGG CCTATGTGCT 11820 

GGGGTGGGGG GA 11832 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acids 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 



Val He Ssr Pro Gin Asp Pro Thr Leu Leu He Gly Ser Ser Leu Gin Ala Thr Cys Ser 
S 10 15 20 

He His Gly Asp Thr Pro 
25 
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(2) INFORMATION FOR SEQ ID No/:40: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: i^gle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(xi) SEQUENCE DESCRIPTION 



GTCCAAGTGC GTTGTAACCC A 



(2) INFORMATION FOR SEQ ID NO 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic a$id 

(C) STRANDEDNESS : S 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION 



GCTGAGTGTG CGCTGGGTCT CACC 



(2) INFORMATION FOR SEQ ID NO 



CHARACTER I SI ICS 



(i) SEQUENCE 

(A) LENGTH: 18 base 

(B) TYPE: nucleic 

(C) STRANDEDNESS: 6 

(D) TOPOLOGY; lineai 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION 
GGCTCCACTC GCTCCAGA 



SEQ ID NO: 40 



41: 



ngle 



SEQ ID NO: 41 



42: 



pairs 
e 



acid 
ingl« 



SEQ ID NO: 42: 



